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To my parents 



PREFACE 



My aim in this book is to give an elementary treatment of linear control 
theory with an H M optimality criterion. The systems are all linear, time- 
invariant, and finite-dimensional and they operate in continuous time. The book 
has been used in a one-semester graduate course, with only a few prerequisites: 
classical control theory, linear systems (state-space and input-output viewpoints), 
and a bit of real and complex analysis. 

Only one problem is solved in this book: how to design a controller which 
minimizes the H^-norm of a pre-designated closed-loop transfer matrix. The 
Hoo-norm of a transfer matrix is the maximum over all frequencies of its largest 
singular value. In this problem the plant is fixed and known, although a certain 
robust stabilization problem can be recast in this form. The general robust per- 
formance problem - how to design a controller which is H^-optimal for the worst 
plant in a pre-specified set - is as yet unsolved. 

The book focuses on the mathematics of H^, control. Generally speaking, 
the theory is developed in the input-output (operator) framework, while computa- 
tional procedures are presented in the state-space framework. However, I have 
compromised in some proofs: if a result is required for computations and if both 
operator and state-space proofs are available, I have usually adopted the latter. 
The book contains several numerical examples, which were performed using PC- 
MATLAB and the Control Systems Toolbox. The primary purpose of the exam- 
ples is to illustrate the theory, although two are examples of (not entirely realis- 
ts) Hqq designs. A good project for the future would be a collection of case stu- 
dies of H^ designs. 

Chapter 1 motivates the approach by looking at two example control prob- 
lems: robust stabilization and wideband disturbance attenuation. Chapter 2 col- 
lects some elementary concepts and facts concerning spaces of functions, both 
time-domain and frequency domain. Then the main problem, called the standard 
problem, is posed in Chapter 3. One example of the standard problem is the 
model-matching problem of designing a cascade controller to minimize the error 



VI 

between the input-output response of a plant and that of a model. In Chapter 4 
the very useful parametrization due to Youla, Jabr, and Bongiorno (1976) is used 
to reduce the standard problem to the model-matching problem. The results in 
Chapter 4 are fairly routine generalizations of those in the expert book by 
Vidyasagar (1985a). 

Chapter 5 introduces some basic concepts about operators on Hilbert space 
and presents some useful facts about Hankel operators, including Nehari's 
theorem. This material permits a solution to the scalar-valued model-matching 
problem in Chapter 6. The matrix-valued problem is much harder and requires a 
preliminary chapter, Chapter 7, on factorization theory. The basic factorization 
theorem is due to Bart, Gohberg, and Kaashoek (1979); its application yields 
spectral factorization, inner-outer factorization, and /-spectral factorization. 
This arsenal together with the geometric theory of Ball and Helton (1983) is used 
against the matrix-valued problem in Chapter 8; actually, only nearly optimal 
solutions are derived. 

Thus Chapters 4 to 8 constitute a theory of how to compute solutions to the 
standard problem. But the H,^ approach offers more than this: it yields qualita- 
tive and quantitative results on achievable performance, showing the trade-offs 
involved in frequency-domain design. Three examples of such results are 
presented in the final chapter. 

I chose to omit three elements of the theory: a proof of Nehari's theorem, 
because it would take us too far afield; a proof of the main existence theorem, for 
the same reason; and the theory of truly (rather than nearly) optimal solutions, 
because it's too hard for an elementary course. 

It is a pleasure to express my gratitude to three colleagues: George Zames, 
Bill Helton, and John Doyle. Because of George's creativity and enthusiasm I 
became interested in the subject in the first place. From Bill I learned some 
beautiful operator theory. And from John I learned "the big picture" and how to 
compute using state-space methods. I am also grateful to John for his invitation 
to participate in the ONR/Honeywell workshop (1984). The notes from that 
workshop led to a joint expository paper, which led in turn to this book. 
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I am also very grateful to Linda Espeut for typing the first draft into the 
computer and to John Hepburn for helping me with unix, troff, pic, and grap. 

Toronto Bruce A. Francis 

May, 1986 



SYMBOLS 



R 


field of real numbers 


C 


field of complex numbers 


L 2 (-oo, oo) 


time-domain Lebesgue space 


L 2 (-oo, 0] 


ditto 


L 2 [0, oo) 


ditto 


L 2 


frequency-domain Lebesgue space 


^00 


ditto 


H 2 


Hardy space 


H oo 


ditto 


prefix R 


real-rational 


ll-ll 


norm on C nXm ; maximum singular value 


IMI2 


norm on L 2 


11*1 loo 


norm on L^ 


superscript J_ 


orthogonal complement 


A T 


transpose of matrix A 


A* 


complex-conjugate transpose of matrix A 


$* 


adjoint of operator $ 


F~(s) 


F(-sf 


n- 


orthogonal projection from L 2 to H^- 


n 2 


orthogonal projection from L 2 to H 2 


i> 


Hankel operator with symbol F 


$x 


image of X under $ 


Im 


image 


Ker 


kernel 


X_(A) 


stable modal subspace relative to A 


X + (A) 


unstable modal subspace relative to A 



IX 

The transfer matrix corresponding to the state-space realization 
{A , B , C , D ) is denoted [A,B,C,D], i.e. 

[A,B,C,D]:=D + C (s -A y l B . 

Following is a collection of useful operations on transfer matrices using this data 
structure: 

[A,B,C,D] = [T~ l AT, T- l B, CT , D) 



[A,B, C ,D\~ l = [A-BD^CyBD-^-D^C^ 



[A,B,C,D]~ = M ' ,-C 1 ,B l ,D 



T n.T uT r>T] 



[A v B v C \, D JxfAg, B 2 , C 2 , D 2 \ 



A l B X C 2 




B X D 2 


A 2 
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. B * 


A 2 " 




' B 2 


B 1 C i A K 


■) 


B X D 2 



,[C 1 D l C 2 \,D 1 D 2 
,\D X C 2 C l ),D l D 2 



[A 1 ,B l , C^Dj-r- [A 2 ,B 2 ,C 2 ,D 2 ] 
\ B i 



A; 

A, 



B, 



,\C X C 2 ],D 1+ D 2 
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CHAPTER 1 
INTRODUCTION 



This course is about the design of control systems to meet frequency-domain 
performance specifications. This introduction presents two example problems by 
way of motivating the approach to be developed in the course. We shall restrict 
attention to single-input, single-output systems for simplicity. 

To begin, we need the Hardy space H^. This consists of all complex-valued 
functions F (s ) of a complex variable s which are analytic and bounded in the 
open right half-plane, Re s >0; bounded means that there is a real number 6 
such that 

| F{s) | < 6 , Re s >0 . 

The least such bound 6 is the R^-norm of F , denoted \\F H^. Equivalently 

||if || 00 :=sup{ | jF («)|:Re«>0}. (1) 

Let's focus on real- rational functions, i.e. rational functions with real 
coefficients. The subset of H^ consisting of real-rational functions will be 
denoted by RH^. If F(s) is real-rational, then F GRH^ if and only if F is 
proper ( | F (oo) | is finite) and stable (F has no poles in the closed right half- 
plane, Re s >0). By the maximum modulus theorem we can replace the open 
right half-plane in (1) by the imaginary axis: 

||F|| 0O =sup{| J F(ja;)|: W eR}. (2) 

To appreciate the concept of H^-norm in familiar terms, picture the Nyquist plot 
of F(s). Then (2) says that ||F H^, equals the distance from the origin to the 
farthest point on the Nyquist plot. 

We now look at two examples of control objectives which are characterizable 
as H^-norm constraints. 
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Figure 1.1. Single-loop feedback system 
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Example 1. 

The first example uses a baby version of the small gain theorem. Consider 
the feedback system in Figure 1. Here P (s ) and K (s ) are transfer functions 
and are assumed to be real-rational, proper, and stable. For well-posedness we 
shall assume that P or K (or both) is strictly proper (equal to zero at s =oo). 
The feedback system is said to be internally stable if the four transfer functions 
from v 1 and v 2 to u x and u 2 are all stable (they are all proper because of the 
assumptions on P and K ). For example, the transfer function from v 1 to u x 
equals (1-PK ) _1 . The Nyquist criterion says that the feedback system is inter- 
nally stable if and only if the Nyquist plot of PK doesn't pass through or encir- 
cle the point s =1. So a sufficient condition for internal stability is the small 
gain condition \\PK || 00 <1. 

Let's extend this idea to the problem of robust stabilization. The block 
diagram in Figure 2a shows a plant and controller with transfer functions 
P (s )+AP (s ) and K (s ) respectively; P represents the nominal plant and AP 
an unknown perturbation, usually due to unmodeled dynamics or parameter vari- 
ations. Suppose, for simplicity, that P , AP , and K are real-rational, P and 
AP are strictly proper and stable, and K is proper. Suppose also that the feed- 
back system is internally stable for AP =0. How large can | AP | be so that 
internal stability is maintained? 

One method which is used to obtain a transfer function model is a frequency 
response experiment. This yields gain and phase estimates at several frequencies, 
which in turn provide an upper bound for | AP (j uj) | at several values of u>. 
Suppose R is a radius function belonging to RH^ and bounding the perturba- 
tion AP in the sense that 

|AP(jw)| < \R(ju)\ for all0<w<oo , 
or equivalently 

|| J R- 1 AP|| 00 <1. (3) 

How large can \ R \ be so that internal stability is maintained? 
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Figure 1.2a. Feedback system with perturbed plant 
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Figure 1.2b. After loop transformation 
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Figure 1.2c. After loop transformation 
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Simple loop transformations lead from Figure 2a to Figure 2b to Figure 2c. 
Since the nominal feedback system is internally stable, K (1-PK ) -1 GRH 00 . Our 
baby version of the small gain theorem gives that the system in Figure 2c will be 
internally stable if 

iiAP/qi-i^m^i. ( 4 ) 

In view of (3) a sufficient condition for (4) is 

WRKil-PKy^Kl. (5) 

We just used the sub- multiplicative property of the H^-norm: 

We conclude that an H^-norm bound on a weighted closed-loop transfer func- 
tion, i.e. condition (5), is sufficient for robust stability. 



Example 2. 

For the second example we need another Hardy space, H 2 . It consists of all 
complex-valued functions F (s ) which are analytic in the open right half-plane 
and satisfy the condition 

l J /2 



sup (27T)- 1 / \F (Z+ju)\ 2 d< 

= —no 



< OO 



The left-hand side of this inequality is defined to be the H 2 -norm of F , \\F || 2 . 
Again, let's focus on real-rational functions. A real-rational function belongs to 
RH 2 if and only if it's stable and strictly proper. For such a function F (s ) it 
can be proved that its H 2 -norm can be obtained by integrating over the ima- 
ginary axis: 

r I 1 / 2 

OO 

= (27T)" 1 / \F{jw)\ 2 du . (6) 



\\F 



Consider a one-sided signal x (t ) (zero for t <0) and suppose its Laplace 
transform x {s ) belongs to RH 2 . Then Plancherel's theorem says 
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oo 

/ X{tfdt = \\x\\i . 



Thus \\x || 2 2 can be interpreted physically as the energy of the signal x (t ). 

Next, consider a system with transfer function F (s ) in RH^. Let the input 
and output signals be denoted by x (t ) and y (t ) respectively. It is easy to see 
that if i6RH 2 and ||£ || 2 =1, then y GRH 2 and ||y || 2 <||-f IL- Tlms the H oo" 
norm of the transfer function provides a bound on the system gain 

su p{||y II 2= £eRH 2 , ||£|| 2 =i}. 

The previous discussion was limited to the familiar class of real-rational 
functions, but the results are general. In fact the H^-norm of the transfer func- 
tion equals the system gain. The precise statement is as follows: If F GH^ and 
x £H 2 , then Fx £H 2 ; moreover 

|| J F|| 0O = sup{||Fi|| 2 : a5 eH 2 ,||x|| 2 =l}. (7) 

With these preliminaries let's look at a disturbance attenuation problem. In 
Figure 1 suppose v j^O and v 2 represents a disturbance signal referred to the 
output of the plant P . The objective is to attenuate the effect of v 2 on the out- 
put u 2 in a suitably defined sense. As before, we shall assume P and K are 
real-rational and proper, with at least one of them strictly proper. The transfer 
function from v 2 to « 2 is the sensitivity function 

S :== (1-PK Y 1 . 

We shall suppose the disturbance w 2 is not a fixed signal, but can be any 
function in the class 

{v 2 :i; 2 — W"x for some x £H 2 , \\x || 2 <1} , (8) 

where W ,W~ l £^i O0 ; that is, the disturbance signal class consists of all v 2 in H 2 
such that 

Il^-^ 2 ll2<l- ( 9 ) 

Assuming for now that the boundary values » 2 (i w ) an< i ^(i w ) are well-defined, 
we can interpret inequality (9) as a constraint on the weighted energy of v 2 : the 
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energy-density spectrum | t/ 2 (iw) | 2 is weighted by the factor | W (j w) \ " 2 . 
For example, if | W(j w) | were relatively large on a certain frequency band and 
relatively small off it, then (9) would generate a class of signals having their 
energy concentrated on that band. 

The disturbance attenuation objective can now be stated precisely: minimize 
the energy of « 2 f° r the worst v 2 in class (8); equivalently (by virtue of (7)), 
minimize \\WS H^, the H m -norm of the weighted sensitivity function. In a syn- 
thesis problem P and W would be given and K would be chosen to minimize 
|| T4^S Hoq, with the added constraint of internal stability. (In an actual design it 
may make more sense to employ W as a design parameter, to be adjusted by the 
designer to shape the magnitude Bode plot of S .) 

To recap, we have seen how certain control objectives, robust stability and 
disturbance attenuation, will be achieved if certain H^-norm bounds are 
achieved. In Chapter 3 is posed a general H M optimization problem which 
includes the above two examples as special cases. 

Notes and References 

The theory presented in this book was initiated by Zames (1976, 1979, 1981). 
He formulated the problem of sensitivity reduction by feedback as an optimiza- 
tion problem with an operator norm, in particular, an H^-norm. Relevant con- 
temporaneous works are those of Helton (1976) and Tannenbaum (1977). The 
important papers of Sarason (1967), and Adamjan, Arov, and Krein (1971) esta- 
blished connections between operator theory and complex function theory, in par- 
ticular, H^-functions; Helton showed that these two mathematical subjects have 
useful applications in electrical engineering, namely, in broadband matching. 
Tannenbaum used (Nevanlinna-Pick) interpolation theory to attack the problem 
of stabilizing a plant with an unknown gain. 

For a survey of the papers in the field the reader may consult Francis and 
Doyle (1986). 



CHAPTER 2 



BACKGROUND MATHEMATICS: FUNCTION SPACES 



The purpose of this chapter is to collect some elementary concepts and facts 
from functional analysis. 



2.1 Banach and Hilbert Space 

Let X be a linear space over the field C of complex numbers. A norm on X 
is a function x — ► \\x || from X to the field R of reals having the four properties 

0) ||* || > 0, 
(ii) ||z|| = iff x=0, 
(Hi) ||ca; || = | c | \\x ||, c £C 
(iv) ||x+y || < ||x|| + \\y\\. 

With such a norm we can talk about convergence in X: a sequence {x k } in X 
converges to x in X, and x is the limit of the sequence, if the sequence of real 
numbers {\\x k -x ||} converges to zero; if such x exists, then the sequence is con- 
vergent. A sequence {x k } is a Cauchy sequence if 

(Ve>0)(3 integer n) i ,k >n ==> || £,•-£* ||<e . 

Intuitively, the elements in a Cauchy sequence eventually cluster around each 
other, so they are "trying to converge". If every Cauchy sequence in X is conver- 
gent (that is, if every sequence which is trying to converge actually does con- 
verge), then X is complete. A (complex) Banach space is a linear space over C 
which has a norm and which is complete. 

A subset S of a Banach space X is a subspace if 
x ,y£S ==> x+y£S 
and 
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xGS, c£C=» cx£S , 

and it is closed if every sequence in S which converges in X has its limit in S. (If 
the dimension of X is finite, then every subspace is closed, but in general a sub- 
space need not be closed.) 

For the definition of Hilbert space start with X a linear space over C. An 
inner product on X is a function (ar ,y ) ->■ <x ,y > from XXX to C having the 
four properties 

(i) <x ,x > is real and >0, 

(ii) <x,x > = iff x=0, 

(iii) the function y -+ <x ,y > from X to C is linear, 

(iv) <x,y> = <y,x>. 

Such an inner product on X induces a norm, namely, ||x || := <x ,x > l l 2 . With 
respect to this norm X may or may not be complete. A (complex) Hilbert space 
is a linear space over C which has an inner product and which is complete. 

Two vectors x , y in a Hilbert space X are orthogonal if <x ,y >=0. If S is 
a subset of X, then S- 1 - denotes the set of all vectors in X which are orthogonal to 
every vector in S; S 1 is a closed subspace for any set S. If S is a closed subspace, 
then S-ms called its orthogonal complement, and we have 

X = S 1 © s . 

This means that every vector in X can be written uniquely as the sum of a vector 
in S-L and a vector in S. 

We shall see in the next two sections several examples of infinite-dimensional 
Banach and Hilbert spaces, but first let's recall the familiar finite-dimensional 
examples of each. 

The space C" is a Hilbert space under the inner product 

<x,y > = x* y . 

Here x and y are column vectors and * denotes complex-conjugate transpose. 
The corresponding norm is \\x || = (x * x ) 1/ ' 2 . 

The space C" Xm consists of all n X m complex matrices. There are several 
possible norms for C" Xm ; for compatibility with our norm on C" we shall take 
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the following. The singular values of A in C" Xm are the square roots of the 
eigenvalues of the Hermitian matrix A* A . We define \\A || to be the largest 
singular value. 

Exercise 1. Prove that 

\\A || =max{||Az|| : ||x|| = 1} . 



2.2 Time-Domain Spaces 

Consider a signal x (t ) defined for all time, -oo<£ <oo, and taking values in 
C" . Thus £ is a function 

(-00,00) -v C" . 

Restrict x to be square-(Lebesgue) integrable: 

00 

/ ||i (OH 2 * <oo. (1) 

-00 

The norm in (l) is our previously defined norm onC". The set of all such sig- 
nals is the Lebesgue space L 2 (-oo,oo). (To simplify notation we suppress the 
dependence of this space on the integer n .) This space is a Hilbert space under 
the inner product 

00 

<x,y> := / x [t)* y(t)dt . 

-00 

Then the norm of x , denoted ||x || 2 , equals the square root of the left-hand side 
of(l). 

The set of all signals in L 2 (-oo,oo) which equal zero for almost all t <0 is a 
closed subspace, denoted L 2 [0,oo). Its orthogonal complement (zero for almost all 
t >0) is denoted L 2 (-oo,0]. 
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2.3 Frequency-Domain Spaces 

Consider a function x(joS) which is defined for all frequencies, -oo<w<oo, 
takes values in C, and is square-(Lebesgue) integrable with respect to w. The 
space of all such functions is denoted L 2 and is a Hilbert space under the inner 
product 

00 

<x,y > := (27I-)- 1 / i(iw)' y(jj)du . 

-00 

The norm on L 2 will be denoted \\x || 2 . The space RL 2 , the real-rational func- 
tions in L 2 , consists of n -vectors each component of which is real-rational, 
strictly proper, and without poles on the imaginary axis. 

Next, H 2 is the space of all functions x (s ) which are analytic in Re s > 0, 
take values in C " , and satisfy the uniform square-integrability condition 

"I 1 / 2 



sup(27r)- 1 / \\x{$+ju)\\*dw 

£^ — nn 



< OO . 



(We have used the same norm symbol for L 2 (-oo,oo), L 2 , and H 2 . Context deter- 
mines which is intended.) This makes H 2 a Banach space. Functions in H 2 are 
not defined a priori on the imaginary axis, but we can get there in the limit. 

Theorem 1. If x £H 2 , then for almost all w the limit 
~x (juj) := lim x (£+i w) 

exists and ~x belongs to L 2 . Moreover, the mapping x — >• ~x from H 2 to L 2 is 
linear, injective, and norm-preserving. 

It is customary to identify x in H 2 and its boundary function ~x in L 2 . So 
henceforth we drop the tilde and regard H 2 as a closed subspace of the Hilbert 
space L 2 . The space RH 2 consists of real-rational n -vectors which are stable and 
strictly proper. 

The orthogonal complement H/ of H 2 in L 2 is the space of functions x [s ) 
with the following properties: x (s ) is analytic in Re s <0; x (s ) takes values in 
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C" ; the supremum 

00 

sup f \\x (£+ 7'w)|| 2 <f w 

«<° -oo 

is finite. Again, we identify functions in H^- and their boundary functions in L 2 . 

Now we turn to two Banach spaces. First, an n X m complex-valued matrix 
F{ju)) belongs to the Lebesgue space L^ iff ||F(yw)|| is essentially bounded 
(bounded except possibly on a set of measure zero). The norm just used for 
F(juj) is the norm on C reXm introduced in Section 2.1 (largest singular value). 
Then the Loo-norm of F is defined to be 

||F||oo:=esssup ||^(j'w)|| . 

This makes L^ a Banach space. It is easily checked that F GRL^ iff F is real- 
rational, proper, and without poles on the imaginary axis. 

The final space is H^,. It consists of functions F (s ) which are analytic in 
Re s >0, take values in C" Xm , and are bounded in Re s >0 in the sense that 

sup{||f (s)|| :Re s >0} < oo . 

The left-hand side defines the H^-norm of F . There is an analog of Theorem 1 
in which H 2 and L 2 are replaced by H^ and L^ respectively: each function in 
Hqq has a unique boundary function in L^, and the mapping from H^-function 
to boundary L^-function is linear, injective, and norm-preserving. So henceforth 
we regard H^ as a closed subspace of the Banach space L^. Finally, RH^ con- 
sists of those real-rational matrices which are stable and proper. 

Let's recap in the real- rational case: 

RL 2 : vector-valued, strictly proper, no poles on imaginary axis 

RH 2 : vector-valued, strictly proper, stable 

RH^ : vector- valued, strictly proper, no poles in Re s <0 

RL^: matrix- valued, proper, no poles on imaginary axis 

RH^: matrix-valued, proper, stable. 
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Exercise 1. In the scalar-valued case prove that RL 2 equals the set of all real- 
rational functions which are strictly proper and have no poles on the imaginary 
axis. 



2.4 Connections 

This section contains statements of two basic theorems relating the spaces 
just introduced. The first, a combined Plancherel and Paley-Wiener theorem, 
connects the time-domain Hilbert spaces and the frequency-domain Hilbert 
spaces. A mapping from one Hilbert space to another is a Hilbert space isomor- 
phism if it is a linear surjection which preserves inner products. (Such a mapping 
is continuous, preserves norms, is injective, and has a continuous inverse.) 

Theorem 1. The Fourier transform is a Hilbert space isomorphism from 
L 2 (-oo,oo) onto L 2 . It maps L 2 [0,oo) onto H 2 and L 2 (-oo,0] onto H 2 . 

This important theorem says in particular that H 2 is just the set of Laplace 
transforms of signals in L 2 [0,oo), i.e. of signals on t >0 of finite energy. 

The second theorem connects the Hilbert space H 2 with the Banach spaces 
Lqq and H TO . For F in L m and X denoting either L 2 or H 2 , let F X denote the 
space {Fx : x £X}. 

Theorem 2. (i) If F eh^, then F L 2 CL 2 and 
\\ F lloo = sup{||Fz || 2 : x GL 2 , ||z || 2 =1} 
= sup{||Fx || 2 : x GH 2 , ||a; || 2 =1} ■ 
(ii) If F GH^, then F H 2 CH 2 and 

11^ lloo = sup{||Fx || 2 : x GH 2 , \\x || 2 =1} . 



s -1 
Exercise 1. Let F (s ) = . Prove that F H is closed in H 

s +1 
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Exercise 2. Let F (s ) = . Prove that F EL is not closed in BL. 

K ' s+1 2 2 



Notes and References 

For Banach and Hilbert spaces the reader is referred to any good book on 
functional analysis, for instance Conway (1985). Standard references for Hardy 
spaces are Duren (1970), Garnett (1981), Hoffman (1962), and Rudin (1966). 
These references deal mainly with scalar-valued functions. For vector- and 
operator-valued functions, see Sz.-Nagy and Foias (1970) and Rosenblum and 
Rovnyak (1985). Theorem 3.1 can be derived from the results in Chapter 11 in 
Duren (1970). Theorem 4.1 is in Paley and Wiener (1934). For an alternative 
proof see Dym and McKean (1972). A complete proof of Theorem 4.2 is not 
readily available in the literature; a starting point could be Problem 64 in Halmos 
(1982), Theorem II.6.7 in Desoer and Vidyasagar (1975), or Theorem II.1.5 in 
Conway (1985). 



CHAPTER 3 



THE STANDARD PROBLEM 



The standard set-up is shown in Figure 1. In this figure w , u , z , and y are 
vector-valued signals: w is the exogenous input, typically consisting of command 
signals, disturbances, and sensor noises; w is the control signal; z is the output to 
be controlled, its components typically being tracking errors, filtered actuator sig- 
nals, etc.; and y is the measured output. The transfer matrices G and K are, 
by assumption, real-rational and proper: G represents a generalized plant, the 
fixed part of the system, and K represents a controller. Partition G as 



G 



G. 



21 ^22 

Then Figure 1 stands for the algebraic equations 
z = G n w + G 12 u 

y = G n w 

u = Ky. 



G 22 u 



To define what it means for K to stabilize G , introduce two additional 
inputs, v j and v 2 , as in Figure 2. The equation relating the three inputs w , v lt 
v 2 and the three signals z , u , y is 



/ 


-C 12 





z 




G n 


w 





/ 


-K 


u 


— 


10 


v i 





~ G 22 


I 


.y , 




G 21 / 


v 2 
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Figure 3.1. The standard block diagram 
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Figure 3.2. Diagram for stability definition 
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It simplifies the theory to guarantee that the proper real-rational matrix 

I -G 12 

I -K 

-G 22 / 

has a proper real-rational inverse for every proper real-rational K. A simple 
sufficient condition for this is that G 22 be strictly proper. Accordingly, this will 
be assumed hereafter. Then the nine transfer matrices from w , v v v 2 to z , u , 
y are proper. If they are stable, i.e. they belong to RH^, then we say that K 
stabilizes G . This is the usual notion of internal stability. An equivalent 
definition in terms of state-space models is as follows. Take minimal state-space 
realizations of G and K and in Figure 1 set the input w to zero. Then K stabil- 
izes G if and only if the state vectors of G and K tend to zero from every ini- 
tial condition. 

The standard problem is this: find a real-rational proper K to minimize 
tne H^-norm of the transfer matrix from w to z under the constraint that K 
stabilize G . The transfer matrix from w to z is a linear-fractional transforma- 
tion of K : 

z =[G u + G 12 K{I-G 2S K)- 1 G 21 ]w. 

Following are three examples of the standard problem. 

A Model-Matching Problem 

In Figure 3 the transfer matrix T x represents a "model" which is to be 
matched by the cascade T 2 Q T 3 of three transfer matrices T 2 , T 3 , and Q . 
Here, T { (?' =1-3) are given and the "controller" Q is to be designed. It is 
assumed that T { G RH^ (»' =1-3) and it is required that Q £ RH TO . Thus the 
four blocks in Figure 3 represent stable linear systems. 

For our purposes the model-matching criterion is 
SU P (II 2 \W- w 6 H 2 , ||w || 2 < 1} = minimum. 



18 



C'h. 3 
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Q 



T, 



Figure 3.3. Model-matching 
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Figure 3.4. Tracking 
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Thus the energy of the error z is to be minimized for the worst input w of unit 
energy. In view of Theorem 2.4.2 an equivalent criterion is 

ll r i ~ T 2 Q TjII^ = minimum. 

This model-matching problem can be recast as a standard problem by defining 



G := 



T 3 



K :=-Q, 

so that Figure 3 becomes equivalent to Figure 1. The constraint that K stabilize 
G is then equivalent to the constraint that Q E RH^. 

This version of the model-matching problem is not very important per se ; 
its significance in the context of this course arises from the fact that the standard 
problem can be transformed to the model-matching problem (Chapter 4), which 
is considerably simpler. 

A Tracking Problem 

Figure 4 shows a plant P whose output, v , is to track a reference signal r . 
The plant input, u , is generated by passing r and v through controllers C x and 
C 2 respectively. It is postulated that r is not a known fixed signal, but, as in 
Chapter 1, may be modeled as belonging to the class 

{ r : r = Ww for some w GH 2 , \\w || 2 < 1 }. 

Here P and W are given and C 1 and C 2 are to be designed. These four transfer 
matrices are assumed to be real-rational and proper. 

The tracking error signal is r -v . Let's take the cost function to be 

(lk-HI 2 2 + IMI 2 2 ) 1/2 , (i) 

where p is a positive scalar weighting factor. The reason for including pu in (1) is 
to ensure the existence of an optimal proper controller; for p = "optimal" con- 
trollers tend to be improper. Note that (1) equals the H 2 -norm of 

r -v 1 

' — L pu J' 

Thus the tracking criterion is taken to be 
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sup{||,z || 2 : w GH 2 ,||w || 2 < 1} = minimum. 
The equivalent standard problem is obtained by denning 
K := [C 1 C 2 ] 



y ■= 



G 



G 21 G 



li 



G 2i : — 



in 



22 
G 12 
^22 



-ft] 

- &]• 



A Robust Stabilization Problem 

This example has already been discussed in Chapter 1. The system under 
consideration is shown in Figure 1.2a. Assume P is a strictly proper nominal 
plant and let R be a scalar-valued (radius) function in RH^. Now define a fam- 
ily P of neighbouring plants to consist of all strictly proper real-rational matrices 
P +AP having the same number (in terms of McMillan degree) of poles in 
Re s >0 as has P , where the perturbation AP satisfies the bound 

\\AP(jw)\\ < \R(ju)\ for all0<w<oo . 

For a real-rational proper K the robust stability criterion is that K stabilize all 
plants in P. Stability means internal stability, that the four transfer matrices in 
Figure 1.2a from v lt v 2 to « , y all belong to RH^. 

We saw in Chapter 1 that robust stability is guaranteed by a small gain con- 
dition. 

Lemma 1. A real-rational proper K stabilizes all plants in P iff K stabilizes the 
nominal plant P and 

WRKil-PKY^K 1. 



We can convert to the set-up of the standard problem by defining G so that 
in Figure 1 the transfer matrix from w to z equals RK (I -PK )~ . This is 
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accomplished by 



G 



RI 

1 P 



Then Lemma 1 implies that the following two conditions are equivalent: K 
achieves robust stability for the original system (Figure 1.2a); in Figure 1 K sta- 
bilizes G and makes the transfer matrix from w to z have H^-norm <1. 



Notes and References 

The standard problem as posed in this chapter is based on Doyle (1984). 
For treatments of the tracking example see Vidyasagar (1985b) and Wang and 
Pearson (1984), and for robust stabilization see, for example, Kimura (1984). 
Lemma 1 is due to Doyle and Stein (1981) and Chen and Desoer (1982). 

There are several other examples of the standard problem, for example, the 
weighted sensitivity problem (Zames (1981)) and the mixed sensitivity problem 
(Verma and Jonckheere (1984), Kwakernaak (1985)). 



CHAPTER 4 



STABILITY THEORY 



In this chapter it is shown how the standard problem can be reduced to the 
model-matching problem. The procedure is to parametrize, via a parameter 
matrix Q in RH^, all K 's which stabilize G , 



4.1 Coprime Factorization Over RH^ 

Recall that two polynomials / (s ) and g (s ), with, say, real coefficients, are 
said to be coprime if their greatest common divisor is 1 (equivalently, they have 
no common zeros). It follows from Euclid's algorithm that / and g are coprime 
iff there exist polynomials x (s ) and y (s ) such that 

fx + gy = 1 . (!) 

Such an equation is called a Bezout identity. 

We are going to take the practical route and define two functions / and g 
in RH^ to be coprime (over RH^) if there exist i, y in RH^ such that (l) 
holds. (The more primitive, but equivalent, definition is that / and g are 
coprime if every common divisor of / and g is invertible in RH^,, i.e. 

h, fh~\ gh" 1 e RH^ => A-^RH^.) 

More generally, two matrices F and G in RH m are right-coprime (over 
RHqJ if they have equal number of columns and there exist matrices X and Y 
in RH m such that 



= XF + YG = I . 



[X Y\ 
This is equivalent to saying that the matrix 



is left-invertible in RH r 
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Similarly, two matrices F and G in RH TO are left-coprimt (over RH^) if 
they have equal number of rows and there exist X and Y in RH such that 



[F G 



FX + GY = / ; 



equivalents, [F G ] is right-invertible in RH^. 

Now let G be a proper real-rational matrix. A right-coprime factorization of 
G is a factorization G =NM~ l where A" and M are right-coprime RH^- 
matrices. Similarly, a left-coprime factorization has the form G =M~ N where 
N and M are left-coprime. Of course implicit in these definitions is the require- 
ment that M and M be square and nonsingular. We shall require special 
coprime factorizations, as described in the next lemma. 



Lemma 1. For each proper real-rational matrix G there exist eight RH^- 
matrices satisfying the equations 

G = NM' X = M _1 JV (2) 



X ~Y 

-N M 



M Y 

N X 



= 1 ■ 



(3) 



Equations (2) and (3) together constitute a doubly- coprime factorization of 
G . It should be apparent that N and M are right-coprime and N and M are 
left-coprime; for example, (3) implies 



[X 



proving right-coprimeness. 

It's useful to prove Lemma 1 constructively by deriving explicit formulas for 
the eight matrices. The formulas use state-space realizations, and hence are 
readily amenable to computer implementation. 

We start with a state-space realization of G , 

G{s) = D + C(s-A Y l B (4) 

A , B , C , D real matrices, 
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with {A ,B) stabilizable and (C ,A) detectable. It's convenient to introduce a 
new data structure: let 

[A,B,C,D] 

stand for the transfer matrix 

D + C (s -A )~ 1 B . 

Now introduce state, input, and output vectors x , u , and y respectively so that 
j/=Gu and 

x = Ax + Bu (5a) 

y = Cx + Du . (5b) 

Next, choose a real matrix F such that A F :== A +BF is stable (all eigenvalues 
in Re s <0) and define the vector v := u-Fx and the matrix C F := C +DF . 
Then from (5) we get 

x = Ap x + Bv 

u = Fx + v 

y = C F x + Dv . 
Evidently from these equations the transfer matrix from v to u is 

M{s):=[A F ,B,F,I] M 

and that from v to y is 

N{s):=[A F ,B,C F ,D\. (6b) 

Therefore 

u = Mv , y = Nv 
so that y =NM~ 1 u , i.e. G ^=NM'K 

Similarly, by choosing a real matrix H so that A H :=A +HC is stable and 
defining 

B H := B + #Z> 

M(«):=|A ff ,Jf,CM] (6c) 
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N(s):=[A H ,B H ,C,D}, (6d ) 

we get G =M~ l N . (This can be derived as above by starting with G{s) T 
instead of G {s ).) Thus we've obtained four matrices in RH^ satisfying (2). 
Formulas for the other four matrices to satisfy (3) are as follows: 

X( 8 ):=[A F ,-H,C F ,I] ( 7a ) 

Y(s):=[A F ,-H,F,0] ( 7b ) 

X(s):={A H ,-B H ,F,I] ( 7c ) 

Y{ 8 ):=[A H ,-H,F,0}. ( 7d ) 

The explanation of where these latter four formulas come from is deferred to Sec- 
tion 4. 



Exercise 1. Verify that the matrices in (6) and (7) satisfy (3). 

Example 1. 

As an illustration of the use of these formulas, consider the scalar-valued 
example 





s (s -2) 




minimal realization is 


G {s ) = [A , B , C , D ] 


A = 


1 

2 


, B = 




1 

- 



C =[-li], D = 0. 
Choosing F to place the eigenvalues of A F (arbitrarily) at {-1, -1}, we get 

F = f-1 -4] 



1 
-1 2 



Then 



26 



Ch. 4 



and 



N{8) = [A F ,B,C,0] 
__ s-1 
" (* +1) 2 

M{s)=[A F ,B,F,l] 

— s ( s ~ 2 ) 
~ {' +1) 2 ' 
Similarly, the assignment 



H = 



-5 

9 



yields 



5 -4 

9 -7 

X{s)= [A F ,-H,C,1] 

— s 2 +6s-23 

(s+lf 

Y{ 8 )=[A P ,-H,F,0] 

— - 41s + 1 
~ («+l) 2 ' 

Finally, in this example we have 

N =N, M =M, X =X, Y 



= Y 



4.2 Stability 

This section provides a test for when a proper real-rational K stabilizes G . 
Introduce left- and right-coprime factorizations 

G = ATM -1 = lr l N (la) 

K = f/F^ 1 = V^U . ( lb ) 
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Theorem 1. The following are equivalent statements about K : 
(i) K stabilizes G , 



(") 



(iii) 



M 



U 



[0 I\N V 



M N 



U\0 I] V 



-i 



GRH n 



-l 



GRH n 



The idea underlying the equivalence of (i) and (ii) is simply that the deter- 
minant of the matrix in (ii) is the least common denominator (in RH^) of all the 
transfer functions from w , v l , v 2 to z , u , y ; hence the determinant must be 
invertible for all these transfer functions to belong to RH^, and conversely. 

The proof of Theorem 1 requires a preliminary result. Insert the factoriza- 
tions (1) into Figure 1, split apart the factors, and introduce two new signals £ 
and T) to get Figure 2. 

Lemma 1. The nine transfer matrices in Figure 1 from w , v v v 2 to z , u , y 
belong to RH^ iff the six transfer matrices in Figure 2 from w , v j, v 2 to £, /? 
belong to RH^. 



Proof. (If) This direction follows immediately from the equations 



= N£ + 







m = U r/ + v j , 
which in turn follow from Figure 2. 
(Only if) By right-coprimeness there exist RH^matrices X and Y such that 

XM + YN = I . 
Hence 

£ = XM £ + YN £ . (2) 
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But from Figure 2 



N£ 



z 

y^v 2 



Substitution into (2) gives 



X 








z 




w 







u 


+ Y 


y _ 


+ x 





- Y 


"2 



Hence the three transfer matrices from w , v v v 2 to £ belong to RH m . 

A similar argument works for the remaining three transfer matrices to r\. Q 



Proof of Theorem 1. We shall prove the equivalence of (i) and (ii). First, let's 
see that the matrix displayed in (ii) is indeed nonsingular, i.e. its inverse exists as 
a rational matrix. We have 



M 



[0 I]N 










I 


u 




V 


— 



I 




[0 I 


G 


I 








I 


^21 


(* 22 





I 


K 




M 


I 






V 








M 


K 
I 




V 



(3) 



Now 

r M 
V 

is nonsingular because both M and V are. Also, since G 22 is strictly proper, we 
have that 



I 











I 


K 


^21 


^22 


I 



is nonsingular when evaluated at s =oo: its determinant equals 1 at s =oo. 
Thus both matrices on the right-hand side of (3) are nonsingular. 
The equations corresponding to Figure 2 are 
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Figure 4.2.1. Diagram for stability definition 



JX <^ 



M~ 



N 



U 



6^ 



Figure 4.2.2. With internal signals 
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■ 








- 




- 





u 






w 




I 






£ 






v l 




V 




V 


— 


v 2 



M 
-[0 I]N V 

Thus by Lemma 1 K stabilizes G iff 

-l 

U 



M 
[0 I]N V 
But this is equivalent to (ii). D 



£ RHoo . 



Exercise 1. Prove equivalence of (i) and (iii) in Theorem 1. 



4.3 Stabilizability 

Let's say that G is stabilizable if there exists a (proper real-rat ioiuil) K 
which stabilizes it. Not every G is stabilizable; an obvious non-stabilizable G is 
<2i2~0> ( -'2i =:= 0, G l 22 == 0, G n unstable. In this example, the unstable part of G 
is disconnected from u and y . In terms of a state-space model G is stabilizable 
iff its unstable modes are controllable from u (stabilizability) and observable 
from y (detectability). The next result is a stabilizability test in terms of left- 
and right-coprime factorizations 

G = NM~ l = M~ l ~N . 



Theorem 1. The following conditions are equivalent: 
(i) G is stabilizable, 

(ii) M , [0 I]N are right-coprime and 




M, 



(iii) M, N 



are left-coprime, 



are left-coprime and 



M , [0 /] are right-coprime. 
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The proof requires some preliminaries. The reader will recall the following 
fact. For each real matrix F there exist real matrices G and H such that 



G 



I 




H. 



The matrices G and H may be obtained by elementary row and column opera- 
tions, and the size of the identity matrix equals the rank of F . The following 
analogous result for RH^-matrices is stated without proof. 

Lemma 1. For each matrix F in RH^ there exist matrices G , H , and F 1 in 
RH^ satisfying the equation 



F = G 



F 1 




H 



and having the properties that G and H are invertible in RH^ and F x is non- 
singular. 



This result is now used to prove the following useful fact that if M and N 



are right-coprime, then the matrix 
which is invertible in RH^. 



M 
N 



can be filled out to yield a square matrix 



Lemma 2. Let M and TV be RH^-matrices with equal number of columns. 
Then M and N are right-coprime iff there exist matrices U and V in RH^ such 
that 

M uX l 



Proof. (If) Define 



X Y 

? ? 



M U 
N V 



where a question mark denotes an irrelevant block. Then 
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[X Y] 



M 

N 



I , 



so M and N are right-coprimc. 
(Only if) Define 



F := 



M 

N 



and bring in matrices G , H , and F x as per Lemma 1. Since i" 1 is left-invertible 
in RHqq (by right-coprimeness), it follows that 

Fi 


is left-invertible in RH^ too. But then it must have the form 

>i 


with F f 1 G RH^. Defining 



K := G 



F X H 
I 



we get 



M 
N 



= K 



Thus the definition 



:- K 



gives the desired result, that 



M U 

N V 



= K 
is invertible in RH m . U 
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The obvious dual of Lemma 2 is that M and N are left-coprime iff there 
exist U and V such that 

M N ~ l 

U V ^ R^oo . 



Proof of Theorem 1. We shall prove equivalence of (i) and (ii). 

(i) => (ii): If G is stabilizable, then by Theorem 2.1 there exist U and V in 
RHqq such that 



M 



U 



€RH n 



[0 I]N V 
This implies by Lemma 2 and its dual that 

M, [0 7]7V are right-coprime 
and 



M, 



(/ are left-coprime. 



But the latter condition implies left- cop rimeness of 



M . 



(ii) => (i): Choose, by right-coprimeness and Lemma 2, matrices X and Y" in 
RH m such that 



M X 

[0 I]N Y 



G RH m 



Also, choose, by left-coprimeness, matrices ii! and T in RH^ such that 



M 





I 






R 
T 



Now define 

U := rz 



(1) 



(2a) 
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V := Y - [0 7]MOT . 
Then we have from (1) and (2) that 

M X 

[0 I]N Y 



(2b) 





7 -RX 




7 











7 


U 



M 
[0 71 AT 7 



(3) 



The two matrices on the left in (3) have inverses in RH^, hence so does the 
matrix on the right in (3). 

The next step is to show that V is nonsingular. We have 



M 
[0 I]N 










7 


u 




V 


— 



7 
[0 JIG 



_ - 










7 


u 




M 


V 




J 



7 


" 




M 





7 U 




7 


G 21 


Gn V 





(4) 



Evaluate all the matrices in (4) at s =oo; then take determinants of both sides 
noting that G 22 is strictly proper and the matrix on the left-hand side of (4) is 
invertible in RH M . This gives 

O^detF(oo) detM(oo) . 

Thus det V (00)7^0, i.e. V~ l exists. Hence we can define K :=UV~ 1 . 

Next, note that U and V are right-coprime (this follows from invertibility 
in RH^ of the matrix on the right-hand side of (3)). We conclude from Theorem 
2.1 that K stabilizes G . □ 

Exercise 1. Prove equivalence of (i) and (iii) in Theorem 1. 



Hereafter, G will be assumed to be stabilizable. Intuitively, this implies 
that G and G 22 share the same unstable poles (counting multiplicities), so to 
stabilize G it is enough to stabilize G 22 . Let's define the latter concept 
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explicitly: K stabilizes G 22 if in Figure 2.1 the four transfer matrices from v l 
and v 2 to u and y belong to RH. 



Theorem 2. K stabilizes G iff K stabilizes G „. 



22* 



The necessity part of the theorem follows from the definitions. To prove 
sufficiency we need a result analogous to Lemma 2.1. 

Lemma 3. The four transfer matrices in Figure 2.1 from v x , v 2 to u , y belong 
to RH oo iff tne f° ur transfer matrices in Figure 2.2 from v v v 2 to f, r] belong to 

The proof is omitted, it being entirely analogous to that of Lemma 2.1. 



Proof of Theorem 2. Suppose K stabilizes G 22 . To prove that K stabilizes 
G it suffices to show, by Lemma 2.1, that the six transfer matrices in Figure 2.2 
from w , v v v 2 to f, ij belong to RH^. But by Lemma 3 we know that those 
from v x , v 2 to f, r/ do. So it remains to show that the two from w to £, r] 
belong to RH^,. 

Set t; x =0 and v 2 =Q in Figure 2.2 and write the corresponding equations: 



M£: 



w + 



Uri 



Vr]={0 I]N£. 
By left-coprimeness there exist matrices R and T in RH m such that 





M 







R 
T 


= I . 




Post-multiply (7) by 


I 



w to get 




MR 


I 



w + 



I 


T 


I 



w = 


I 




(5) 
(6) 

(7) 



(8) 
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Now subtract (8) from (5), rearrange, and define 



to get 



M£i 



{Vi+Uv). 



Also, rearrange (6) and define 

v 2 := [0 7]ATi? 



w 



to get 



(9) 



(10) 



(11) 



(12) 



Vri=[0 I}N£ 1 + v 2 . (13) 

The block diagram corresponding to (11) and (13) is Figure 1. By Lemma 3 and 
the fact that K stabilizes G 22 we know that the transfer matrices in Figure 1 
from «!, v 2 to fj, r; belong to RH TO . But by (10) and (12) those from w to v v 
v 2 belong to RH M . Hence those from w to £ x , rj belong to RH^. Finally, we 
conclude from (9) that the transfer matrix from u> to £ belongs to RH^. U 

Exercise 2. Suppose G n =G U =G 2 \=G 22 . Prove that G is stabilizable. 



4.4 Parametrization 

This section contains a parametrization of all K 's which stabilize G 22 . To 
simplify notation slightly, in this section the subscripts 22 on G 22 are dropped. 
The relevant block diagram is Figure 1. 

Bring in a doubly-coprime factorization of G , 



G 



= NM 

X -Y 

-N M 



-l 



M~ L N 



M Y 

N X 



= I 



(1) 



C'li. I 



» 


AC 1 


*1 

s 


f 








V 






'/ 



A' 



Or 



Figure 4.3.1. For proof of Theorem 4.3.2 



K>JL 



a 



K 



6 



Figure AAA. Diagram for controller parametrization 
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and coprime factorizations (not necessarily doubly-coprime) of A , 

K = UV~ l = V _1 17 . 
The first result is analogous to Theorem 2.1; the proof is omitted. 

Lemma 1. The following are equivalent statements about K : 



K stabilizes G , 



(») 

(iii) 



M U 


-i 


N V 




V -U 


-N h 


f _ 



€ RHoo > 

-1 



The main residt of this chapter is the following. 

Theorem 1. The set of all (proper real-rational) A"s stabilizing G is 
parametrized by the formulas 

K = {Y~MQ){X~NQY l (2) 

= {X -Q~N)-\Y-QM) (3) 

Q e RH M . 



Proof. Let's first prove equality (3). Let Q eRH^,. From (1) we have 

I q] \x -y] \m rl 1 ~Q 

1 -N M NX 07 ^ 1 

so that 



X-QN ~{Y~QM) 

-N M 



M Y-MQ 
N X-NQ 



I . 



(4) 



Equating the (l,2)-blocks on each side in (4) gives 
(X-QN){Y~MQ ) = {Y-QM){X-NQ ) 
which is equivalent to (3). 
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Next, we show that if K is give D by (2), it stabilizes G . Define 
U := Y MQ , V := X --NQ 
U := y-QM , V :=X-QN 
to get from (4) that 



V -V 

-N M 



M U 

N V 



= I . 



(5) 



It follows from (5) that U, V are right-coprime and U, V are left-coprime 
(Lemma 3.2). Also from (5) 
.1-1 

£RH r 



M U 

N V 



*-oo ■ 



So from Lemma 1 A' stabilizes G . 

Finally, suppose K stabilizes G . We must show K satisfies (2) for some Q 
m K-Hoo- Let K = UV~ 1 be a right-coprime factorization. From (l) and defining 
D := MV-NU we have 



X -Y 

-N M 



M I) 
N V 



I XV YV 
D 



(6) 



The two matrices on the left in (6) have inverses in RH^, the second by Lemma 
1. Hence D'eRH^. Define 

Q :=^(XU-YV)D~ 1 , 
so that (6) becomes 

X -Y 

-N M _ 

Pre-multiply (7) by 



M Y 
N X 



M V 
N V 



1 -QD 
D 



(7) 



and use (l) to get 



M U 




M Y 


I 


~QD 


N V 

L 




N X 





D 
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Therefore 



{Y~MQ)D 
{X-NQ)D 



Substitute this into K =■ UV' 1 to get (2). U 



As a special case suppose G is already stable, i.e. G GRH^. Then in (1) we 
may take 

N = N = G 

X = M = I , X =M =1 

Y = 0, Y =0 , 

in which case the formulas in the theorem become simply 

K =-Q(I-GQ)- 1 

= -{I-QG)~ l Q . 

There is an interpretation of Q in this case: -Q equals the transfer matrix from 
v 2 to u in Figure 1 (check this). 

We can now explain the idea behind the choice (1.7) of X, Y , X , Y in Sec- 
tion 1. Recall that the state-space equations for G were 

x = Ax + Bu 
y = Cx + Du , 
that 



:= A + BF , 



A + EC 



were stable, and that we defined 

B H :■= B + ED , C p := C + DF . 

Let's find a stabilizing K by observer theory. The familiar state-space equa- 
tions for K are 

i = Ax + Bu + E(Cx + Du - y) 

u = Fx , 
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or equivalently 

x = Ax + By 

u = Cx , 
where 

A := A + BF + EC + ffl)F = A F + #C> 

B :=-H 

6 := F . 
Thus in terms of our data structure 

K {s ) = [A , B , C , 0] . 
By observer theory K stabilizes G . 

Now find coprime factorizations of K in the same way as we found coprime 
factorizations of G in Section 1. To get a right-coprime factorization 
K = YX~* we first choose F so that A F := A +BF is stable. It is convenient 
to take F := C F , so that A F = A F . By analogy with (1.6) we get K = YX~\ 
where 

I(»):=[A„fl,F,/] 

= [A F ,-H, C F ,I] 
Y(s):=[A F ,B, 6,0] 

= \A F ,~H,F,0} . 

A similar derivation leads to a left-coprime factorization K = X~ l Y , where 
X{s):=[A H ,-B B ,F,I] 
Y{s):=[A H ,-H,F,0}. 
These formulas coincide with (1.7). 
By Lemma 1 we know that 



M Y 

N X 



-l 



GRH 
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X ~Y 

-N M 

Hence the product 



X -Y 

-N M 



grh 



M Y 

N X 



must be invertible in RH^. The only surprise is that the product equals the 
identity matrix, as is verified by algebraic manipulation. 



Exercise 1. In Figure 4 suppose G [s ) 



1 



s(s-l) 



. Consider a controller of the 



form 



K 



-Q 



l~GQ 

where Q is real-rational. Find necessary and sufficient conditions on Q in order 
that K stabilize G . 



4.5 Closed-Loop Transfer Matrices 

Now we return to the standard set-up of Figure 1, Chapter 3. Theorem 4.1 
gives every stabilizing K as a, transformation of a free parameter Q in RH^. 
The objective in this section is to find the transfer matrix from w to z in terms 
of Q. 

In the previous section we dropped the subscripts on G' 22 ; now we must 
restore them. Bring in a doubly-coprime factorization of G 22 : 



G 22 = N 2 M 2 1 



~M 2 X N 2 



^2 ~Y 2 

~~N 2 M 2 



M 2 Y 2 

N 2 X 2 



= I . 



Then the formula for K is 

K =(y 2 -M 2 Q)(X 2 -JV 2 Q)- 1 



(1) 



(2a) 
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Now define 

T i '•= G u + G 12^2 Y 2 G 21 



T 2 := G 12 M 2 



T 3 :=M 2 G 2l 



(2b) 

(3a) 
(3b) 
(3c) 



Theorem 1. The matrices T { (i =1-3) belong to RH^. With K given by (2) 
the transfer matrix from w to z equals T j-T 2 QT 3 . 

Proof. The first statement follows from the realizations to be given below. For 
the second statement we have 

z =[G n + G l2 {I-KG 22 y l KG 2] \w . (4) 

Substitute G 22 = N 2 M 2 l and (2b) into [I -KG 22 )~ l and use (1) to get 

(I-KG 22 Y 1 = M 2 (X S -QN 2 ). 
Thus from (2b) again 

{I-KG 22 y l K =M 2 {Y 2 -QM 2 ). 
Substitute this into (4) and use the definitions of T,- to get 

z =(T 1 -T 2 QT 3 )w . D 



For computations it is useful to have explicit realizations of the transfer 
matrices T, {i =1-3). Start with a minimal realization of G : 

G {s ) = [A , B , C , D ] . 

Since the input and output of G are partitioned as 



the matrices B , C , and D have corresponding partitions: 
B = \B X B 2 \ 
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C = 



D 



A" 




C 2. 




'D n D u 


Pti 


D 2 2_ 



Then 

G lj {s)=[A,B J ,C i ,D t] ], i,j =1,2. 

Note that D 22 =0 because G 22 is strictly proper. It can be proved that stabiliza- 
bility of G (an assumption from Section 3) implies that (A ,B 2 ) is stabilizable 
and (C 2 ,A ) is detectable. 

Next, find a doubly-coprime factorization of G 22 as developed in Section 1. 
For this choose F and H so that 

A F := A + B 2 F , vl 7/ := A + EC 2 

are stable. Then the forum his are as follows: 

= {A P ,B 2 ,F,I] 

= [A F ,5 2 , C 2) 0] 

■■[A H ,H, C 2 ,I] 

[A H ,B 2 , C 2 ,0] 

\A F ,-H, C 2 ,I] 

[A F ,-H,F,0] 

\A H ,~B 2 ,F,I] 

[A B ,-H,F,Q\. 

Finally, substitution into (3) yields the following realizations: 
T 1 {s)=[A,B,C,D u ] 

~A F -B 2 F' 
A, 



M 2 (s) 


N 2 (s) 


M 2 (s) 


N 2 (s) 


* 2 («) 


Y2(*) 


X 2 (s) 


~Y 2 (s) 



B 



H 
B 1 +HD 21 
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C =[C l+ D l2 F -D 12 F] 

T 2 {s)={A F ,B 2 ,C 1+ D 12 F,D 12 } 

T 3 {8)=[A E ,B 1 +HD 2V C 2 ,D 21 ]. 

It can be observed that T, GRH^ (t =1-3), as claimed in Theorem 1. For exam- 
ple, this is how the realization of T 2 is obtained: 

T 2 (s)=G n (s)M 2 (s) 

= \A,B 2 , C V D 12 }X[A F ,B 2 ,F,1] 





'A B 2 F 




PH 


■ 


A F 


J 


B K 




'a 









A F 


J 


B 2 \' 



[Ci D l2 F],D l2 



(5) 



[^ , B 2 , C 1+J D 12 F,Z> 12 ] 



Similarity transformation by 



/ / 
I 



was used in (5). 



Example 1. 

Consider the tracking example of Chapter 3 with 
s-1 



P{8) = 

W(s) = 



s (s -2) 

10s +1 
and p=l. We have 



G 



12 



Gn G 



G 21 G 22 



S+l 



Gn(fi) 



10s +1 




, G 12 (fi) 



G 21 (fi)=G n ( s ), G 22 ( s 



s-1 

s (s ^2) 
1 





s-1 



s (s -2) 
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A minimal realization of G is 
G{s)=\A,B ,C , D 



A = 



.1 











2 








1 






, Jj — 



1 


o" 





1 









09 


-1 


1 











09 











1 


-1 



c = 



For F and H we may take 
F = [0 3 -1] 

H 

Then 



A F 



, D = 



.1 








1 


.1 





















-9 





-5 J 



-.1 





" 





-1 


-1 





1 






- A H 



A = 



.10 

-1 -1 

1 















o" 





3 


1 











.1 











-7 


9 





-4 


5 



c = 



.09 




Finally, 



T l (s) = 



-1 1 
-3-1031 



s+1 



105+1 





T 2 (* 



s'+s +1 
s 2 +s+l 



.1 








-7 9 





-4 5 



B 
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T 3 {s) = T^s). 

Exercise 1. It's desired to find a K which stabilizes G and makes the dc gain 
from w to z equal to zero (asymptotic rejection of steps). Give an example of a 
stabilizable G for which no such K exists. Find necessary and sufficient condi- 
tions (in terms of 1\ , t =1-3) for such K to exist. 

The results of this chapter can be summarized as follows. The matrix G is 
assumed to be proper, with G 22 strictly proper. Also, G is assumed to be stabil- 
izable. The formula (2) parametrizes all K 's which stabilize G . In terms of the 
parameter Q the transfer matrix from it' to z equals T X ~T 2 QT 3 . Such a func- 
tion of Q is called affine. 

In view of these results the standard problem can be solved as follows: First, 
find a Q in RH^ to minimize \\T X -T 2 QT ^W^, i.e. solve the model-matching 
problem of Chapter 3. Then obtain a controller K by substituting Q into (2). 



Notes and References 

The material of this chapter is based on Doyle (1984). Earlier relevant refer- 
ences are Chang and Pearson (1978) and Pernebo (1981); a more general treat- 
ment is given in Nett (1985). 

As a general reference for the material of this chapter see Vidyasagar 
(1985a). The idea of doing coprime factorization over RH LXj is due to Vidyasagar 
(1972), but the idea was first fully exploited by Desoer el al. (1980). State-space 
formulas for coprime factorizations were first developed by Khargonekar and Son- 
tag (1982). The state-space formulas in Section 1 are from Nett et al. (1984). 
The important parametrization of Theorem 4.1 is due to Youla et al. (1976) as 
modified by Desoer et al. (1980). Finally, see Minto (1985) for a comprehensive 
treatment of stability theory by state-space methods. 



CHAPTER 5 



BACKGROUND MATHEMATICS: OPERATORS 



The purposes of this chapter are to introduce some basic definitions about 
operators on Hilbert space and to study in some detail a certain type of operator, 
namely, a Hankel operator. 



5.1 Hankel Operators 

Let X and Y be Hilbert spaces and let $ be a linear function from X to Y. 
The function $ is bounded if there exists a real number a such that 

11**11 < « Ik || , iGX. 

The least such a is called the norm of $, denoted ||$||. The following equations 
are not hard to derive: 

||$|| = sup {||*x || : ||x || < 1} 

= sup {||$z || : ||x || = 1}. 

Such a bounded linear function is called an operator. 

Example 1. 

The Fourier transform, denoted T, is an operator from L 2 (-oo,oo) to L 2 . 
Theorem 2.4.1 says that its norm equals 1. 

Example 2. 

Recall from Section 2.3 the direct sum 

L 2 (-oo,oo) = L 2 (-oo,0] © L 2 [0,oo) . 

Each function / in L 2 (-oo,oo) has a unique decomposition / = / l -\- f 2 with 
/ 1 GL 2 (-oo,0] and / 2 £L 2 [0,oo): 
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/i(0= / (0. f 2 (t) = o, t<o 
/i(0 = o, f 2 (t)=f(t), t>o. 

The function / —>■ f L from L 2 (-oo,oo) to L 2 (-oo,0] is an operator, the orthogo- 
nal projection of L 2 (-oo,oo) onto L 2 (-oo,0j. It's easy to prove that its norm 
equals 1. 

In the same way we have 

JL 2 z==z "2 ^ 2 ■ 

The orthogonal projection from L 2 onto H 2 ^will be denoted ITj and from L 2 onto 
H 2 by n 2 . 

Example 3. 

Let F GL^ and define the function A F from L 2 to L 2 via 

& F 9 := Fg . 

Thus the action of A F is multiplication by F . Obviously A F is linear. Theorem 
2.4.2 says that ||A^. || = \\F H^,, so A F is bounded. This operator is called a 
Laurent operator and F is called its symbol; so A F is the Laurent operator with 
symbol F . 

A related operator is A F | H 2 , the restriction of Ap to H 2 , which maps H 2 
to L 2 . Theorem 2.4.2 says that its norm also equals \\F H^. 

Observe that if F (EH^ then, also by Theorem 2.4.2, 

A F H 2 C H 2 . 

The converse is true too: if A F H 2 CH 2 , then F GH^. (The rational version of 
this result will be proved in Lemma 8.3.1.) 

Example 4. 

This is the time-domain analog of the previous example. Recall that convo- 
lution in the time-domain corresponds to multiplication in the frequency-domain. 
Suppose F (s ) is a matrix-valued function which is analytic in a vertical strip 
containing the imaginary axis and which belongs to L M . Taking the region of 



r>o 



t 'h. r, 



L 2 (-oo,oo) ZL ». L„(-oo,oo) 



T 



L, 



T 



Figure 5.1.1. Example 4 
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convergence to be this strip, let / (t ) denote the inverse bilateral Laplace 
transform of F (s ). Now define the convolution operator 5 t from L 2 (-oo,oo) to 
L 2 (-oo,oo) via 

y — - By u 

oo 

V{*)=- I f {t-r)u{r)dT. 

-00 

This system is linear, but not necessarily causal because / (t ) may not equal zero 
for negative time. Note that the system is causal iff S, maps L 2 [0,oo) into 
L 2 [0,oo), i.e. "5/ leaves the future invariant". The operators 3, and A F are 
intimately related via the Fourier transform. The relationship is exhibited in the 
commutative diagram of Figure 1. 

Example 5. 

Again let F GL^,. The Toeplitz operator with symbol F , denoted Q F , maps 
H 2 to H 2 and is defined as follows: for each g in H 2 , & F g equals the orthogonal 
projection of F g onto H 2 . Thus 

& F = Il 2 Ap I H 2 . 

The relevant commutative diagram is Figure 2. As a concrete example consider 
the scalar-valued function F (s ) = l/(s -l) in RL^. For g in H 2 we have 

Fg = g i + g 2 
g^B.2 1 , g 2 eH 2 

0i(O = 0(l)/(*-l) 
ff 2 (*)== Iff (O-P (!)]/(« -1). 
Thus ®p maps g to g 2 . 
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Figure 5.1.2. Example 5 




Figure 5.1.3. Example 6 
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Example 6. 

For F in L^ the Hankel operator with symbol F, denoted F F , maps H 2 to 
H/ and is denned as 

I> := U^p I H 2 . 

The corresponding commutative diagram is Figure 3. For the example 
F(s)= l/(s -1), F F maps fir (s ) in H 2 to g (l)/(s -1) in H., 1 . Note that T F =0 if 

The relationship between the three operators A F , @ F , and T F can be 
described as follows. We have 

A^rHg 1 © Ha^H/© H 2 , 

and correspondingly we can regard A F as a 2X2 matrix with operator entries: 



A/? = 


A 21 


A 12 

A 22 _ 


>r example 


A u = n x A f | H + 


follows from the definiti 


A F | H 2 


= 


&F 



l 12" 



"-22" 



Example 7. 

In this example we study the Hankel operator with the special symbol 

F{ s ) = [A,B,C,0], 

where A is antistable (all eigenvalues in Re s >0). Suppose A isnXn. Such 
F belongs to RL^. The inverse bilateral Laplace transform of F {s ) is 

/ {t) = ~Ce At B , t<0 

f (t) = 0, t>0 . 

The time-domain analog of the Hankel operator, denoted T , , maps a function « 
in L 2 [0,oo) to the function y in L 2 (-oo,0] defined by 
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OO 



»(«)=// {t-T)u{r)dT, t<0 





oo 



= -Ct M J t- AT Bu{T)dr, t<0. (1) 

o 

Define two auxiliary operators: the controllability operator 

* c :L 2 [0,oo) -+ C n 
oo 

o 
and the observability operator 

V :C» - L 2 (-oo,0] 

(¥ a: )(*):= Ce M x , t <0 . 
From (1) we have that 

r, = * # c . 

Exercise 1. Show that * c is surjective if (A ,B ) is controllable and that ty is 
injective if (C ,A ) is observable. 

There is a systemic interpretation of T t in terms of the usual state-space 
equations 

x = Ax + Bu (2) 

V = Cx . (3) 

To see the action of T i , solve these equations in the following way. First, apply 
an input u in L 2 [0,oo) to equation (2) with initial condition x{0)=x and such 
that x [t ) is bounded on [0,oo). Then 

x (t) = e At x + e M j e- AT Bu{r)dT, t >0 



so that 



00 

x = -j e ~ A T Bu (r)d t 
o 
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= * e u . 

Now solve (2) and (3) backwards in time starting at t =0 and noting that 
u (t )=0 for t <0. The solution is 

y{t)= Ce M x Q 

= (*.*o)(0> *<«• 

In this way T f maps future input to initial state to past output. 

We shall need the concept of the adjoint of an operator $ from X to Y, two 
Hilbert spaces: it's the unique operator $' from Y to X satisfying 

<$x,y > = <x,$*y > , x£X, y £Y . 

The operator $ ' $ from X to X is self-adjoint, i.e. it equals its adjoint. The 
norms of $ and $ * $ are related as follows: 

ll*H 2 = ll*'*||. (4) 

The adjoint of a Laurent operator A^ can be obtained explicitly as follows. 
Introduce the notation 

F~(j«):=F(; U )' , (5) 

where * here denotes complex-conjugate transpose. If F GRL^, then we shall 
interpret F~ as 

F~[s):=F{-s) T , 
which is consistent with (5). If g and h belong to L 2 , then 

<A F g ,h > — <g ,Apk > 
and 



<A P g ,h > = (27T)- 1 J g (j w)' F (j w)* h (j u)d 



U! 



= <g,F~h> . 
We conclude that A F equals the Laurent operator with symbol <F ' 
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Similarly, the adjoint of a Hankel operator T F can be characterized as fol- 
lows. Let g £H 2 and h £H 2 A Then 

<g,Tph> = <T F g,h> 

= <H x Fg ,h > 

= <Fg ,h > because Tl 2 Fg ]_h 

= <g,F~h> 

— <g ,U 2 F~h > because g lll 1 i ;, ~/i . 

We conclude that 

r; = n 2 A/ | h/ . 

For example if jF (s ) = (s -l)~\ then T F maps h {s ) in H^ to -h (~l)/(s +1) in 
H 2 . (Verify.) 

Exercise 2. Show that the adjoints of ty c and *$/ are as follows: 
*/ : C» -+L 2 [0,oo) 
(V e 'x){t) = -B T e- ATt x , t>0; 
0? ; : L 2 (-oo,0]-C B 

* V = Je ATt C T y{t)dt . 

-oo 

The ranA; of an operator $ : X— >-Y is the dimension of the closure of its 
image space $X. 

Our interest is in Hankel operators with real-rational symbols, Example 7 
being a special case. 

Theorem 1. If F GRL^, then T F has finite rank. 

Proof. There is a unique factorization (by partial-fraction expansion, for exam- 
ple) F = F i + F 2, where F x is strictly proper and analytic in Re s <0 and F 2 
is proper and analytic in Re s >0, i.e. F jE'RH. 0O - Since Y F =T Fi , we might as 
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well assume at the start that F is strictly proper and analytic in Re s <0. Intro- 
duce a minimal realization: 

F{s)=[A,B,C,0}. 

The operator T F and its time-domain analog have equal ranks. As in Example 7 
the latter operator equals * * c . By controllability and observability * c is sur- 
jective and * is injective. Hence * * c has rank n , so T F does too. □ 

We need another definition. Let $ be an operator from X to X, a Hilbert 
space. A complex number X is an eigenvalue of $ if there is a nonzero x in X 
satisfying 

$x = \x . 

Then x is an eigenvector corresponding to X. In general an operator may not 
have any eigenvalues! 

For the remainder of this section let F GRL^. The self- adjoint operator 
T F F F maps H 2 to itself and its rank is finite by Theorem 1. This property 
guarantees that it does in fact have eigenvalues. We state without proof the fol- 
lowing fact. 

Theorem 2. The eigenvalues of T F F F are real and nonnegative and the largest 
of them equals HF^'r^ ||. 

This theorem together with (4) says that \\T F \\ equals the square root of the 
largest eigenvalue of T F r F . So we could compute ||r F || if we could compute the 
eigenvalues of T F T F . How to do this latter computation is the last topic of this 
section. 

We continue with the notation introduced in Example 7 and the proof of 
Theorem 1. The self-adjoint operators * c */ and * '* map C n to itself. 
Thus they have matrix representations with respect to the standard basis on C n . 
Define the controllability and observability gramians 



oo 

I 



J e- At BB T e- ATt dt (6) 
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oo 

L :=/ e- ATt C T Ce- At dt . (7) 

o 

It is routine to show that L c and L are the unique solutions of the Lyapunov 
equations 

AL C + L C A T = BB T (8) 

A T L +L A = C T C . (9) 

Exercise 3. Prove that the matrix representations of * c ty c and ^ ^ are L c 
and L respectively. 

Theorem 3. The operator F F T F and the matrix L c L share the same nonzero 
eigenvalues. 

Proof. Let X be a nonzero eigenvalue of T F T F . It's easy to show that X is also 
an eigenvalue of the time-domain analog of T F T F , which equals *P C ^ ^ ^c • 
Hence there exists a nonzero u in L 2 [0,oo) satisfying 

*/*>„ * e „ = \ u . (10) 

Pre-multiply (10) by * c and define x := * c u to get 

L e L x = \x . (11) 

If x , i.e. ^ c u , were to equal zero, then so would Xm from (10). This is not pos- 
sible (both X and u are nonzero), so x is an eigenvector of L c L and X is an 
eigenvalue. 

Conversely, let X be a nonzero eigenvalue of L c L and x a corresponding 
eigenvector. Pre-multiply (11) by */i and define u := */£,, x to get (10). 
The function « is nonzero because x is nonzero and ty c and L are injective. 
Therefore X is an eigenvalue of ^ c ** \P„ * c , hence of T F T F . U 

In summary, the norm of T F for F in RL^ can be computed as follows. 
First, find a minimal realization (A ,B ,C) of the antistable part of F (s ), i.e. 
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F (s ) = \A , B , C , 0] + (a matrix in RH ro ) . 

Next, solve the Lyapunov equations (8) and (9) for L c and L . Then \\T F 
equals the square root of the largest eigenvalue of L c L„ . 



5.2 Nehari's Theorem 

In this section we look at the problem of finding the distance from an L^- 
matrix R to H^: 

dist(£ ,HJ := inf {p-XIL: XeEU . 

In systemic terms we want to approximate, in L^-norm, a given unstable transfer 
matrix by a stable one. Nehari's theorem is an elegant solution to this problem. 
A lower bound for the distance is easily obtained. Fix X in H^. Then 

P-*llcc = l|A*-A x || 

>\\Ti^ R -k x )\n 2 \\ 

= l|r K -r x || 
= lir* II ■ 

The last equality is due to the fact that T x =0. Thus ||r_R || is a lower bound for 
the distance from R to H^. In fact it equals the distance. 

Theorem 1. There exists a closest H^-matrix X to a given L^-matrix R , and 

p-x|| = ||r^||. 

In general there are many X 's nearest R . Interpreted in the time-domain 
Theorem 1 states that the distance from a given noncausal system to the nearest 
causal one (the systems being linear and time-invariant) equals the norm of the 
Hankel operator; in other words the norm of the Hankel operator is a measure of 
noncausality. How to find nearest H^-matrices is the subject of Section 8.3. 
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Example 1. 

As an illustration, let's find the distance from 

1 



R(s) 



S 2 -l 



4 

1 s+1 

s 2 -s+l s ~ 1 



to Hqq. First, find the strictly proper antistable part of R . In this simple case 
partial fraction expansion suffices; a state-space procedure is described in the 
proof of Theorem 7.3.1. We get 

R(s) = R 1 (s)+R 2 (s) 
•5 



Ri(s) 



s-1 

1 



s 2 -s+l s-1 



R 2 eRH O0 . 
Now get a minimal realization: 
R 1 {s)=[A,B,C,0] 



A 



C 



2 


-2 


1 





1 














1 

















1 



B 



1 




















2 



.5 -.5 .5 
1-11 



The solutions of the Lyapunov equations are 



L r = 



.3333 -.1667 

.1667 

-.1667 .3333 

2 



.6250 -1.125 .6250 -.3333 

-1.125 2.625 -1.625 1 

.6250 -1.625 1.125 -.6667 

-.3333 1 -.6667 .5 
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Finally, the distance from R to H^ equals the square root of the largest eigen- 
value of I, L n , 1.2695. 



Notes and References 

An elementary book on operators is Gohberg and Goldberg (1981). Many 
relevant and interesting results can also be found in Halmos (1982) and Rosen- 
blum and Rovnyak (1985). Theorem 1.1, known as Kronecker's lemma, is a basic 
fact used in linear system theory. Theorem 1.2 is a special case of a result on 
compact operators; see for example Theorem 4.4 in Gohberg and Goldberg (1981). 
Example 1.7 and Theorem 1.3 are from Glover (1984). Theorem 2.1 is a generali- 
zation of Nehari's original result (Nehari (1957)); for a (relatively) simple proof 
see Power (1982). 



CHAPTER 6 



MODEL-MATCHING THEORY: PART I 



The model-matching problem is this: given three matrices T t in RH M , find 
a matrix Q in RH^ to minimize || T X ~T 2 QT ^l^. This chapter discusses when 
the problem is solvable and gives a complete solution in the scalar-valued case. 



6.1 Existence of a Solution 

To each Q in RH^ there corresponds a model-matching error, 
|| T j^-T 2 QT 3 \\ 00 . Let a denote the infimal model-matching error: 

c t :=mi{\\T 1 -T t QT z \\ a ;.Q&Si O0 }. (1) 

A matrix Q in RH^ satisfying 

a = ||r 1 ^r 2 Qr 3 || 0o 

will be called optimal. 

This section is concerned with the question of when an optimal Q exists. 
Let's look at a few examples to get a feel for the problem. 

Example 1. 

The trivial case is when the linear equation 

r 1 =r 2 gr 3 (2) 

has a solution in RH^. Such a solution is obviously optimal. It is not a difficult 
problem, but is not germane to this course, to get necessary and sufficient condi- 
tions for solvability of (2). Solvability hardly ever occurs in practice, because it 
means in the standard problem that the exogenous signal w can be completely 
decoupled from the output signal z . 
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The remaining examples are of the special case where T, and Q are scalar- 
valued. Then there's no need for both T 2 and T 3 since T 2 QT 3 =T 2 T 3 Q , so 
we may as well suppose T 3 =1. In addition, we'll need the following definition: 
a function F in RL^ is all-pass if | F (jw) | = constant. 

Example 2. 

Take T 2 to have one zero in Re s >0: 

iy ' s+1 
Then for every Q in RH^ 

lirx-raQIU > |2\(i)-r 2 (i)Q(i)| 

= 1^(1)1 , 
so a> I T^l) | . Defining 

we have Q GRH^ and 

l|r 1 -r 2 g|| 00 = |r,(i)| , 

so Q is optimal. Moreover, T 1 ~T 2 Q is all-pass; in fact it's a constant. Thus in 
this example an optimal Q exists and, furthermore, T X -T 2 Q is all-pass. We'll 
see later that the optimal Q is unique in this example. 

Example 3. 

Take 

1 



T 2 (*) = 



s+1 
So T 2 has a zero at s =oc but no (finite) zeros in Re s >0. For Q in RH C 

IIZVTjQIL^ IT^oo)! , 

so a> I r i (oo) I . As in the previous example, defining 

we have that Q is optimal and T 1 -~T 2 Q is again all-pass. If, however, 
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then Q =0 is also optimal, but T ' t -T 2 Q is not all-pass unless T 1 happens to be. 

Example 4. 

For an example where an optimal Q doesn't exist take 

^ ' 8+1' ^ ' (S+If 

It is claimed that a=Q. To see this define 

es +1 
Then 



(s+l)(es+l) 
so from the Bode plot of this function 

\\T 1 ~T 2 Q e \\ 0O <e. 

Thus ||r l -T 2 Q||oo can be made arbitrarily small by suitable choice of Q in 
RHqq, i.e. a=0. But the only solution of 

is Q (s )=s +1, which doesn't belong to RH^. So an optimal Q doesn't exist. 

The following theorem provides a sufficient condition for an optimal Q to 
exist. 

Theorem 1. An optimal Q exists if the ranks of the two matrices T 2 (j'w) and 
r 3 (ju) are constant for all 0<w<oo. 

The proof of this theorem involves some advanced tools from functional 
analysis and so is omitted. The rank conditions will be assumed to hold for the 
remainder of this chapter. Note that they don't hold in Examples 3 and 4; since 
an optimal Q does exist in Example 3, the conditions are not necessary for 
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existence. 

Let's see what the rank conditions mean for a specific control problem. 



Example 5. 

Consider the tracking problem outlined in Chapter 3. We have 

G 



G 



li 



G 



21 



"Gi 


G 12 








G 2 


^22 




wl 




~P 







) G 12 = 


s 1 


- 


w 




' G 22 = 




p 





To conform with the assumptions of Chapter 4 (G proper, G 22 strictly proper, 
G stabilizable), assume W GRH^ and P is strictly proper. 

Bring in a doubly-coprime factorization of P : 

P = N P Mp- 1 = Mp^Np 



X P -Y P 

-N P M P 



Mp Y P 
N P X P 



= I 



Then a doubly-coprime factorization of G 22 as in (4.5.1) is 



N 9 



N 2 = 





Np 


J 


M 2 = -Mp 





, M 2 = 


I 

Mp 


I 

X F 


, Y 2 = 


[o r P ] 



X 2 — Xp , Y 2 
From (4.5.3) 



T 2 = G 12 M 2 = 



[o r P ; 



pMp 
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T 3 = M 2 G n = 



W 




If p>0, then rank T 2 {j w) is constant for all 0<w<oo; this follows from 
right-coprimeness of N P , M P . If p were equal to zero, then T 2 (oo) would equal 
because P is strictly proper. Thus T 2 satisfies the rank condition iff the con- 
trol energy is weighted in the performance function (see equation (3.1)). 

The matrix T 3 satisfies the rank condition iff rank W [j w) is constant for all 
0<w<oo. This is a type of nonsingularity condition on the reference signal r in 
Figure 4 of Chapter 3. 

Exercise 1. Find 

infdl^-r^lL-.QGRH^} 
for 

ri(.) = ^±L, T t (.)= s 



S+l ^ ' 5+1 



6.2 Solution in the Scalar- Valued Case 

This section contains a complete solution to the model-matching problem 
when the T i 's are scalar-valued. As mentioned in the previous section, we may 
assume r 3 =l. To conform with Theorem 1.1 it is also assumed that T 2 (iw)^0 
for all 0<w<oo. Finally, it is assumed that T % l ^RH^ to avoid the trivial 
instance of the problem. 

We begin with the notions of inner and outer functions. A scalar-valued 
function T in RH^ is inner if T~T=1, i.e. 

T(-s)T{s) = l, 
and outer if it has no zeros in Re 5 >0. Examples of inner functions are 

Is ls+s 2 *' 



1+* 1 + S+S 2 

Inner functions have pole-zero symmetry with respect to the imaginary axis: 
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s =s is a zero iff its mirror image s =~7 is a pole. Observe that the zeros of 
an inner function all lie inside the right half-plane (hence the adjective "inner"). 
Examples of outer functions are 

. s+2 s 



8+1 



The zeros of an outer function all lie outside the right half-plane Re s >0 (hence 
"outer"). In electrical engineering terminology, an inner function is stable and 
all-pass with unit magnitude and an outer function is stable and minimum phase. 

Lemma 1. Every scalar-valued function T in RH^, has a factorization 
T = T { T with T, inner and T outer. If T (j w)^0 for all 0<w<oo, then 
T ~ GRH^. 

Proof. Let T, be the product of all factors of the form 



a +s 

where a ranges over all zeros of T in Re s >0, counting multiplicities, and 
define T := T j T,- . Then T t and T are inner and outer respectively, and 
T = Tj T g . If T is not strictly proper and has no zeros on the imaginary axis, 
then T has these two properties too, so T^GRH^,. □ 

A factorization of the above form is called an inner-outer factorization. (A 
state-space procedure for doing matrix inner-outer factorization will be developed 
in Section 7.4.) 

Exercise 1. Prove that inner and outer factors are unique up to sign, i.e. if 

Ui u„ = v,. v 

{/,- , Vi inner 

U , V outer and nonzero , 
then either 

v t =v t , u = v 



68 Ch - 6 



or 

u { =-v t , u =-V . 

Returning to the model-matching problem, bring in an inner-outer factoriza- 
tion of T 2 : r 2 =r 2l - T 2o . For Q in RH ro we have 

\\T \-T 2 Q | loo = ll^ , l~^2! T 2o Q lloo 

= l|r 2l (r 2 -; 1 r 1 -r 2e Q)|| 00 

= ||r 2 - 1 r 1 -r 20 Q|| 0O (i) 

1 1 r> x^ 1 1 

— ||-« — -A- Moo » 

where 

R := T 2 ?T X (2) 

X := T 2o Q . (3) 

Equality in (1) follows from the property | T 2i (j'w) | =1. Note that R GRL^. 
Also, since T 2o and r^GRH^ (3) sets up a one-to-one correspondence between 
functions Q in RH^ and functions X in RH^. We conclude that 

a = inf{||/?-X|| 0O :XGRH 00 } (4a) 

= dist(# .RH.J . ( 4b ) 

A function X in RH^ satisfying 

a = \\R -X Hqq 
will be called optimal. An optimal X yields an optimal Q via (3). 

The optimization problem in (4) is much like that in Section 5.2, finding the 
distance from R to H^. Since RE^CH^, we have 

dist(fl .RH^) > dist(fl 3^) . ( 5 ) 

It will turn out that the distances are in fact equal because R is real-rational, so 
that, from (4) and Nehari's theorem, a equals ||r# ||. 

For the main results we shall use the machinery of Chapter 5. Factor R as 

R = it j + it 2 
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with R j strictly proper and analytic in Re s <0 and R 2 in RH^. Then R t has 
a minimal state-space realization 

R 1 (s)=[A,B,C,0} (6) 

with A antistable. The controllability and observability grammians are the 
unique solutions of 

AL C + L c A T = BB T (7) 

A T L + L A = C T C . (8) 

Let X 2 equal the largest eigenvalue of L c L and let w be a corresponding eigen- 
vector: 

L c L w = X 2 w . (9) 

Defining v := X _1 L w , we have the pair of equations 

L c v =\w (10) 

LgW = \V . (ll) 

Finally, define the real-rational functions 

/ (s) := [A , w , C , 0] (12) 

g( S ):={~A T ,v,B T ,0}. (13) 

Observe that / eRHg 1 and g £RH 2 . 

Lemma 2. The functions / and g satisfy the equations 

T R g = \f (14) 

r«7 = x<? . (is) 

Proof. To prove (14) start with (7). Add and subtract sL c on the left-hand 
side to get 

-(s -A )L C + L c (s +A T ) = BB T . 

Now pre-multiply by C [s ~A ) -1 and post-multiply by (s +A )~ 1 v to get 

-CL C {s +A T )~ 1 v + C {s -A )- x L t v 
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^Cis-AytBBTis+A 1 )-^ . (16) 

The first function on the left-hand side belongs to H 2 ; from (10) and (12) the 
second function equals \f {s ); and from (6) and (13) the function on the right- 
hand side equals R x (s )g (s ). Project both sides of (16) onto H 2 1 to get 

\f =U 1 R 1 g 

= r Rl g ■ 

But T Ri =T R ; hence (14) holds. 

Equation (15) is proved similarly starting with (8). U 

Vectors / and g satisfying (14) and (15) are said to be a Schmidt pair for 
the operator T^ . 

Exercise 2. Prove that / ~/ — g~g . 
Notice from (14) and (15) that 

?1Tr 9 = ^9 , ( 17 ) 

i.e. g is an eigenvector of T R T R corresponding to X 2 , the largest eigenvalue (by 
Theorem 5.1.3). 

Theorem 1. The infimal model-matching error a equals \\T R ||, the unique 
optimal X equals R-af /g , and, for the optimal Q , T^T 2 Q is all-pass. 

Proof. From Nehari's theorem there exists a function X in H ra such that 

P-*lloo = lir*l|. ( 18 ) 

It is claimed that 

(R-X)g = T R g . U 9 ) 

To prove this, define h := {R -X)g and look at the L 2 -norm of h -T R g : 

\\h-r R g\\i = <h-T R g,h-T R g> 



Ch. 6 71 

= <h,h> + <T R g,T R g > 

-<h,T R g > ~ <T R g,h> . (20) 

Being in H 2 \ T R g is orthogonal to the H 2 -component of h . Thus 
<h,T R g> = <n 1 h,T R g> . 



But 



IIj/i =n i (R~X)g 
= Tl l Rg 
= T R9 • 



Hence 



<h,T R g > = <T R g,T R g > . (21) 

Using (21) in (20) we get 

II* ~ T R g\\?= <h,h>~ <T R g,T R g> 

= <h,h> - <g,T R T R g> 

= <h,h> - \ 2 <g ,g > from (17) 

= \\(R-X)g\\?-\*\\g\\i 



^(p-jrii^-x^iij, 



12 



But ||jR -X|| 0O =X from (18), so h =F R g . This proves the claim. 

From the claim we get the first two assertions in the theorem statement as 
follows. Every X in H^ satisfying (18) also satisfies (19). But (19) has a unique 
solution, namely, 

X =R -\f jg . 

(Here (14) is used.) Since the latter function belongs to RH^,, equality holds in 
(5). Thus from (4) and Nehari's theorem 

a = x = ||r* || . 

Therefore X=R-af jg . 
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To prove the third assertion in the theorem statement start with (2) and (3) 
to get 

T X -T 2 Q = T 2i (R-X). 

To prove that T \-T 2 Q is all-pass, it suffices to show that R -X is all-pass. But 
R-X = X/ /g and 

I / (i w ) I = I 9{j<*>) I 

from the preceding exercise. D 

Let's summarize this section in the form of an algorithm to compute a and 
the optimal Q . 

Step 1. Do an inner-outer factorization 
T 2= T 2iT 2o ■ 

Step 2. Define 

R := T 2 ~ l T 1 
and find a minimal realization 

R (s ) = \A , B , C , 0] + (a function in RH^). 

Step 3. Solve the equations 

AL C + L C A T = BB T 
A T L + L A = C T C . 

Step 4. Find the maximum eigenvalue X 2 of L c L and a corresponding eigen- 
vector w . 

Step 5. Define 

f (s)=\A, w, C,0] 
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g(8)=\-A T ,\- 1 L w,B T ,0] 
X =R -\f /g . 

Step 6. Set a=\ and Q =T 2o 1 X. 



Example 1. 

The algorithm applied to 
«+l 



n ' 10s +1 ' ty ' 



(« -!)(« -5) 

(s +2f 



goes like this: 



Step 1. 



T 2i (s) 



(s-l)(s-5) 
(a +l)(s +5) 



» r 2o( s 



(g +l)(s +5) 
(« +2) 2 



Step 2. 



v ' (10s+l)(s-l)(s-5) 



1 
5 



A -- 

C = fl 1 



, 5 



-6/11 
90/51 



Step 3. 



.1488 -.1604 
-.1604 .3114 



. L 



.5 .1667 
.1667 .1 



Step 4. 



X = .2021, 



1 
.7769 
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Step 5. 

, , x .2231s -4.223 , , -.2231s -4.2231 

/ ( s ) = —, — Wf — ^r > 9 (* ) 



(s-l)(s-5) ' " y ' (s+l)(s+5) 

X(s) = 3.021 ( g +l)( s + 5 ) 
v ; (10s +l)(s +18.93) 



Step 6. 

(*+2) 2 



a = .2021, Q (s ) = 3.021 



(10s +l)(s +18.93) 



6.3 A Single-Input, Single- Output Design Example 

This section illustrates an application of the previous one to a very simple 
design problem. It is emphasized that the design problem is not meant to be 
entirely realistic; in particular, it lacks bandwidth constraints on the controller 
and ignores stability margin. 

Consider the single-loop feedback system in Figure 1 and assume P is 
strictly proper and K is proper. The transfer function from reference input, w , 
to tracking error, z , is the sensitivity function 

S :=1/{1+PK). (1) 

Suppose w has its energy concentrated on a frequency band, say [0,w x ]. Then an 
appropriate performance specification would be 

| S (j u>) | < e for all w in [0,Wj] , 

where e is pre-specified and less than 1. For example, e™.01 would provide less 
than 1% tracking error of all sinusoids in the frequency range [0,Wj]. To simplify 
notation slightly, define the L^-function 

X(j'w) := 1 if | w | < wj 

:= if | w | > Wj . 

Then the performance specification is 

||X5|| 00 <£. (2) 



C'h. (5 



((' 



<>^0-H 



Figure 6.3.1. Single-loop feedback system 



dl) 



■20* 




Figure 6.3.2. Bode plot of II 
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We consider the following design problem: given P , u> v and e, design K to 
achieve specification (2) subject to the constraint of feedback stability. 

Figure 1 looks like Figure 4.4.1 with G := -P . Thus Theorem 4.4.1 yields 
a formula for all stabilizing controllers. Bring in a coprime factorization of -P : 



-P =N/M 


(3) 


MX - NY = 1 . 


(4) 


1, N, M, X, Y all eEH ro , 


. Then all proper stabilizing K 's are given by 



the formula 

K =(Y-MQ)/(X-NQ) (5) 

Q e RH^ . 
Substitute (3) and (5) into (1) and simplify using (4) to get 

S = MX - MNQ . (6) 

So the design problem is reduced to determining a function Q in RH^ such that 

MMX-MNQ^Kt. (7) 

The left-hand side of (7) has the form 

but the T i 's belong to L^ instead of RH^ as required in the previous section. 
So let's approximate x by an RH^-function W k where 

W(s):= (.Olwf 1 * +l)/(.lwf 1 s +1) . 

The piecewise-linear magnitude Bode plot of W k is shown in Figure 2. Its 
characteristics are that it is nearly unity on the operating band [0,0^] (in fact to a 
decade above) and then it drops off to -20k db at high frequency. The integer k 
remains unspecified at this stage; it's regarded as a design parameter and its 
function is explained below. 

We now observe that if || W k S || 00 <e, then \\xS {{^ is less than e, or at least 
not much larger than e. This is because | W (j lj) | k and x(j v) are nearly 
equal on the operating band. Using (6) again we conclude that the design 
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problem reduces to this: find k >1 and Q in RH^ such that 

\\W k [MX-MNQ)\\ O0 <i, 
i.e. 

I|ri-r 2 <?|| 00 < £ , ( 8 ) 

where T l :=W k MX , T 2 :=W k MN. Clearly T t and r 2 GRH TO . 
As in (1.1) define 

a^inffllr^QlLrQeRKU. 

Here a depends on the integer k , so let's write a k . The following result justifies 
the introduction of k . 

Lemma 1. If P has no zeros on the imaginary axis in the frequency range 
[0,100w x ], then 

lim a k = . 

k ->oo 

Proof. Let Xi(j w) equal 1 up to w=100wj and beyond. We have 

a k = inf {|| W k M{X-NQ ^ : Q GRH^} . 
For an arbitrary 6>0 choose a function Q in RH^ such that 

IIXi(X-^Q)|| 00<< 5. (9) 

It's possible to do this because N has no zeros on the imaginary axis in the fre- 
quency range [O.lOOwj]; so Q can be chosen to approximate X /N over this seg- 
ment of the imaginary axis. (A rigorous justification of Q 's existence uses 
Runge's theorem.) Now we get 

<\\M W^maxiWx, W'iX-NQ)^, \\{l-Xi)W k (X-NQ)\U . (10) 
Since || W k H^^l we have from (9) 

WxtW'iX-NQ^KS. (a) 

Also 
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Wil-xJW^X-NQ)^ < \\X-NQ\U\(l- Xl )W'\\ o0 . (12) 

Using (11) and (12) in (10) gives 

a k < HMIL max(<5, \\X -NQ HJKl-Xi)^* ID • 

It follows from Figure 2 that 

lim ||(l-Xi)W*Hoo = 0. 
k -+oo 

Thus for sufficiently large k , a k < \\M W^S. Since S was arbitrary, we conclude 

that a k — >0. D 

In view of Lemma 1 the design problem reduces to choosing k such that 
a k <e and then finding a Q in RH^ to satisfy (8). There remains one hitch 
though: it's not true that T 2 (j w)^0 for all 0<w<oo; in particular, T 2 is 
strictly proper because P is. So in fact the infimum 

infdlTVr^lLtQGREU (13) 

is not achieved. 

Let's see how to approach the optimization problem (13) when P has neither 
poles nor zeros on the imaginary axis. Then T 2 (j oj)^0 for all 0<w<oo, but 
T 2 (yoo)=0. Introduce a polynomial 

V(s) = (s+l)l 

where the integer / equals the relative degree of T 2 , number of poles minus 
number of zeros. Then T 2 V is proper but not strictly proper. Instead of (13), 
consider 

inf{||r 1 -r 2 l'Q 1 || 00 : Q.eRHoo}. 

The theory of the previous section applies to this problem, and there exists a 
unique optimal Q v Now define Q := VQ v Then Q is stable but not proper. 
We can approximate Q on the operating band [0,0^] by rolling off at high fre- 
quency, say a decade above Wj: 

Q a (s):=Q (s )/(.Wf 1 S +1)' . 
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Let's recap by listing the steps in the design procedure. The input data are 
P , Wj, and e, and P has neither poles nor zeros on the imaginary axis. 

Step 1. Do a coprime factorization of -P : 
~P =N/M 
MX - NY = 1. 

Step 2. Define the weighting function 

W(s) = (.Olwf 1 * + l)/(.lwf 1 s +1) 
and initialize k to 1. 

Step 3. Set 

T 1 = W k MX 

r 2 = W k MN 

V(s) = {s+1) 1 

I = relative degree of P . 

Step 4. By the method of Section 2 compute 

a k =mm{||r l -r 2 FQ 1 || 0O :Q 1 eRH 0O }. 
If a k > e > increment k by 1 and go back to Step 3. Otherwise, continue. 

Step 5. By the method of Section 2 compute the function Q 1 in RH^ such that 

a k =||r 1 -r 2 vg 1 || 00 . 

Step 6. Define 

e«(«)= V{ B )Q 1 (8)/(.luj 1 - 1 s+iy 
K =(Y-MQ a )/(X-NQ a ). 
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How the performance specification (2) is (approximately) achieved is sum- 
marized as follows. We have in succession 

IIxS|Ico = IIx/(i+p^)IL 

= \\ X {MX-MNQ a )\\ 00 
KsMMX-MNQJWv, 
< e . 

Example 1. 

This example illustrates the above procedure for the nonminimum phase 
plant 

(s+l)(s 2 +s+l) 
and the performance specs w 1 =0.01, e=0.1 (-20 db). Thus we are to achieve less 
than 10% tracking error up to 0.01 rad/s. 



Step 1. 



N =-P, M =1, X =1, Y = 



Step 2. 



W{s) 



s+1 
10s +1 



Step 3. 



Ti(*) = 



Ti(*) 




(' -!)(« -2) 

10S +1 J ( s+ l)( s 2 + s+1 ) 



V{s) = s+1 
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Step 4. 

«i = .2299 
a 2 = .05114 
Thus a 2 <e. 



Step 5. 



Q 1 {,) = -6.114 (^+-3613)( g 2 +a+il 
(s +4.656)(s +1) 2 



Step 6. 



O (j1 - 6111 (s+-3613)( S 2 +s+l) 
*° y ' ' (s +4.656)(s +1)(10« +1) 

if ( s ) = .6114 ^ S +- 3613 )( s + 1 )( s 2 + s + 1 ) 



{s +.004698)(s +.5280)(s 2 +5.612s +9.599) 



The Bode magnitude plot for this design is shown in Figure 3. The design 
didn't quite meet the spec ( | S (.01 j) | =-18.4 db); this is because of the approx- 
imations made during the procedure. 



Notes and References 

The question of when the model-matching problem has a solution is dealt 
with in, for example, Francis (1983). 

An example of the scalar-valued model-matching problem is the weighted 
sensitivity problem. This was solved in Zames and Francis (1983) and Francis 
and Zames (1984). Lemma 2.2 is from Glover (1984) and the proof of Theorem 
2.1 is adapted from the elegant paper of Sarason (1967). The nice state-space 
formulas for / and g (equations (2.12) and (2.13)) are due to Silverman and Bet- 
tayeb (1980). 

The design problem of Section 3 can be made more realistic by incorporating 
global bounds on | S \ , as for example in O'Young and Francis (1985), or 
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bounds on the complementary sensitivity function 1-S , as in Foo and 
Postlethwaite (1984), Kwakernaak (1985), and Verma and Jonckheere (1984). 
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Figure 6.3.3. Magnitude Bode plot of S, Example 1 



CHAPTER 7 



FACTORIZATION THEORY 



The purpose of this chapter is to develop some basic tools required for the 
solution of the model-matching problem in the matrix-valued case. 



7.1 The Canonical Factorization Theorem 

Consider a scalar-valued real-rational function G {s ) which is proper and has 
no poles on the imaginary axis; thus G ERL^. Suppose in addition that G has 
no zeros on the imaginary axis nor at infinity. Then G~ 1 GRL 00 too. Now con- 
sider the problem of factoring G as G = G + G _ where G + has all its poles and 
zeros in Re s >0 and G _ has all its poles and zeros in Re s <0. Furthermore, 
we require that G + and G _ be proper and have proper inverses. When does such 
a factorization exist? A moment's thought will lead to the conclusion that G has 
such a factorization iff it has the property 

{no. poles in Re s <0} = {no. zeros in Re s <0}, 

or equivalently 

{no. poles} = {no. zeros in Re s <0} + {no. poles in Re s >0}. 

The purpose of this section is to derive the analogous condition in the matrix 
case and give a procedure for doing such a factorization. 

Let G(s) be a square matrix such that G , G^ 1 £RL 0O . Thus G and its 
inverse are proper and have no poles on the imaginary axis. Our goal is to factor 
G as G = G + G _, where the factors G + and G _ are square and have the proper- 
ties 

G_, G_~ GRH^ 

G-^G^reRH^. 
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The latter condition means that G + and its inverse are proper and analytic in 
Re s <0. For ease of reference let's call a factorization as just described a canon- 
ical factorization of G . 

We begin with a minimal realization, 

G {s ) = [A , B , C , D } . 
Since G (oo)=jD and G^GRL^, we see that D is invertible. Define 

A x := A - BD~ l C 
and write the state-space equations for G : 

x = Ax + Bu 

y — Cx + Du . 
Re-arrange to get y as input and u as output: 

x = A x x + BD^y 

u =-~D~ 1 Cx + D~ x y . 



Thus 



G{s)- l = \A*,BD-\~D- l C,D- 1 }. (1) 



Next we recall the notions of modal subspaces. Suppose A is of dimension 
n X n . Let a(s ) denote the characteristic polynomial of A and factor it as 
a(s ) = a_{s )a + (s ), where a_ has all its zeros in Re s <0 and a + has all its zeros 
in Re s >0. (There are no zeros on the imaginary axis.) Then the modal sub- 
spaces of R n relative to A are 

X_(A ) := Ker a_{A ) 

JL+{A ) := Ker a + {A ) , 

where Ker denotes kernel (null space). It can be shown that X_(A ) is spanned 
by the generalized (real) eigenvectors of A corresponding to eigenvalues in 
Re s <0; similarly for X + (.A ). These two modal subspaces are complementary, 
i.e. they're independent and their sum is all of R n . Thus we write 

R» = X_(A ) © X + (A ) . 
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(The two subspaces are not orthogonal in general.) 

Bases for modal subspaces can be computed using standard numerical linear 
algebra. For example, suppose it's desired to compute a basis for X_(A ). 
Transform A to real Schur form, ordering the eigenvalues with increasing real 
part. Then partition the Schur form as 



A, 



(2) 



where A 1 has all its eigenvalues in Re s < and A 4 has all its eigenvalues in Re 
s >0. Let T be the orthogonal transformation matrix, i.e. T AT equals 
matrix (2), and partition T conformably: 

T =[T 1 T 2 ). 

Then X_(A ) = Im T 1( the column span of T x . 

Now consider the two modal subspaces X_(A x ) and X + (A ). The former is 
associated with left half-plane zeros of G and the latter with right half-plane 
poles of G . 

Theorem 1. G has a canonical factorization if X_(A x ) and X + (A ) are comple- 
mentary. 

Since the proof is constructive and is used several times in the sequel, it's 
useful to present it as an algorithm. 

Step 1. Obtain real matrices T l and T 2 , each with full column rank, such that 

X.(4 x ) = ImT 1 (3) 

X + (A) = Imr 2 . (4) 

Define T := [T x T 2 ] and note that T is square and nonsingular by the 
hypothesis of the theorem. 



Step 2. Introduce the partitions 

A x A 2 



T~ l AT = 



A 3 At 



(5) 
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T -i B 



Bi 



B, 



(6) 

CT = [C x C 2 \ (7) 

corresponding to the above partition of T , e.g. A t is square and its dimension 
equals that of X_(.A x ). 



Step 3. Define 

G + {s):=[A 4 ,B 2 ,C 2 ,D] 
G_{s):=[A v B 1 ,D- 1 C v I\ 



(8) 
(9) 



The proof that these steps do indeed result in a canonical factorization uses 
the following two exercises. 

Exercise 1. In (5) show that A 2 = 0, A t is stable, and A 4 is antistable (all 
eigenvalues in Re s >0). (Hint: use (4).) 



From the partitions (5)-(7) we have 



T' 1 A X T 



A 1 -B 1 D' 1 C 1 -B l D~ l C 2 
A 3 -B 2 D~ 1 C 1 A i -B 2 D~ l C 2 



Exercise 2. Show that A 3 ~B 2 D ^ C 1 = 0, A 1 -B 1 D~ 1 C 1 is stable, and 
A i -B 2 D~ 1 C 2 is antistable. (Hint: use (3).) 



It follows from the first exercise that G _p" , GLERH^. Also, since 

G + {s)- 1 = [A 4 -~B 2 D- 1 C 2 ,B 2 D- 1 ,-D- 1 C 2 ,D' 1 } 

G_{s)- 1 = [A 1 ~B 1 D~ 1 C 1 ,B 1 ,-D- 1 C v I], 

we conclude from the second exercise that Gj 1 , (G+ 1 j^GRH^,.. Finally, the 
verification that G = G + G „ is as follows: 
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G + (s )G_{ 8 ) = {A 4 , B 2 , C 2 ,D}X\A v B v D~ l C v 1} 



A 1 

B 2 D~ 1 C l A 4 



Bo 



[Ci C 2 ],D 



A 1 

A 3 A 4 



Br 
B 



2 J 



, [C?i c 2 ],/? 



= [r-Ur, r- 1 ^, ct,d 



[A,B, C,D 



G(s) 



The converse of Theorem 1 is true too, but we shan't need it. 



7.2 The Ifamiltonian Matrix 

In this section we examine the modal subspaces relative to a certain type of 
matrix, namely, a Hamiltonian matrix of the form 



H 



A -iJ 
-Q -A T 



(1) 



Here A , Q , and R are real n Xn matrices, Q and R are symmetric, and R is 
either positive semi-definite or negative semi-definite. The modal subspaces of H 
live in R 2n . 

We consider first the simpler case where R ==0 and A is stable. The 
Lyapunov equation 

A T X + XA + Q = (2) 



has a unique solution X . 
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Theorem 1. (R =0, A stable) The modal subspaces relative to H are 
X + (#) = Im 



X_(#) = Im 



Proof. Define the nonsingular matrix 



/ 
X I 



Then 



T -i HT 



A 

-A ' J 



x + (r- 1 i?r) = im 



Hence 



X + (i/)= TTL + (T- X HT) 
= Im T 



= Im 

Similarly for X^(ff ). D 



Now we turn to the general case. It is claimed that the spectrum of H is 
symmetric with respect to the imaginary axis. To see this, introduce the 
In X2re matrix 



-I 

1 



1 u 

having the property J 2 =-J. Then 
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J' X HJ =-JHJ =-H T . 
Thus H and -H T are similar. Hence X is an eigenvalue of H iff -X is. 

Now assume H has no eigenvalues on the imaginary axis. Then it must 
have n eigenvalues in Re s <0 and n in Re s >0, i.e. the modal subspaces both 
have dimension n . 



Theorem 2. Assume H has no eigenvalues on the imaginary axis and (A ,R ) is 



stabilizable. Then X_(.ff ) and Im 



are complementary. 



Proof. Introduce a 2n Xn real matrix T such that 

XL(ff) = Im T 
and partition it as 



(3) 



T = 



Tx 



{nXn blocks). Then X_(if ) and Im 

First it is claimed that 
T?T 2 = T^T,. 
There is a stable n Xn matrix H _ such that 

HT = TH_ . 
Again, bring in the 2n X2n matrix 



are complementary iff T \ is invertible. 



(4) 
(5) 



/ 



-I 

1 



and pre-multiply (5) by T T J : 

T T JET = T T JTH__ . 
Now, JH is symmetric, hence so is the right-hand side of (6): 

(T T JT)H_ = H^T T J T T 



(6) 
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= -Hl{T T JT). (7) 

Since H_ and (-Hi) have disjoint spectra, the unique solution of (7) (a 
Lyapunov equation) is T T JT =0. But this is equivalent to (4). 

The next claim is that Ker T t is invariant under H _. Let x £Ker T v Pre- 
multiply (5) by [I 0] to get 

AT 1 -RT 2 = T X H_. (8) 

Pre-multiply this by x T T 2 and post-multiply by x to get 

-x T T%RT 2 x = x T T^T x H_x . (9) 

But from (4) the right-hand side equals 

x T T?T 2 H_x = . 
Thus the left-hand side of (9) equals zero, i.e. 

RT 2 x = . (10) 

We just used the sign semi-definiteness of R . Post-multiply (8) by x and use 
(10) to get T 1 H_x =0, i.e. H _x £Ker T v 

Finally, to prove that T x is invertible suppose, on the contrary, that 
Ker Tj^O. Then H _ restricted to Ker T 1 has an eigenvalue X and a correspond- 
ing eigenvector x : 

H_x = \x (11) 

Re X < , 0^ zeKer T x . 
Now pre-multiply (5) by [0 /]: 

-QT 1 -A T T 2 =T i H_. (12) 

Post-multiply this by x and use (11) to get 

{A T +\)T 2 x =0 . (13) 

Then (10), (13), and stabilizability of (A ,R ) imply that T 2 x =0. (Recall that 
stabilizability implies 

rank [A -/u. R ] = n 

for all Re fi>0.) But if T x x =0 and T 2 x =0, then Tx =0, which implies x =0, 
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a contradiction. D 

The Riccati equation associated with H is 

A T X + XA - XRX + Q = . (14) 

Observe that (14) reduces to (2) when R =0. Theorem 2 is related to (14) in the 
following way. 

Corollary 1. Under the same assumptions as in Theorem 2, there exists a 
unique matrix X such that 



X_(ff) = Im 



(15) 



Moreover, X is symmetric, it satisfies (14), and A -RX is stable. 



Proof. Continuing with the notation in the previous proof, define 
X :— T 2 T { l . Then (15) is immediate from (3): 



X_(i7) = Im 



Im 



Im 



T 1 



Uniqueness is easy too, because 



Im 



I 



Im 



X, 



iff X 1 =X 2 . 

To prove that X is symmetric, start with 
XT 1= T 2 . 
Pre-multiply by T^ to get 
T[XT 1 = TfT 2 . 



(16) 
(17) 
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Now take transpose of (16) and post-multiply by T 1 to get 

T[X T T 1 =T^T 1 . (18) 

Equations (4), (17), and (18) together with nonsingularity of T x imply X T =X. 
Next, rewrite equation (5) as 



H 



i.e. 



H 



T, 



r x ff„ 



TyH^T^ 



Pre-multiply by [X -7] to get 
[X -I)H 



0. 



(19) 



This is precisely the Riccati equation. 

Finally, pre-multiplication of (19) by [I 0] gives 

A ~RX = T X H_T{ 1 , 
proving that A ~RX is stable since H _ is. D 

In general the Riccati eq nation (14) has several solutions, only one of which 
satisfies (15). 



7.3 Spectral Factorization 

Consider a square matrix G [s ) having the properties 
G ,G~ (ERL^ 
G~ = G 
G (oo) > . 



(la) 
(lb) 
(1c) 



Such a matrix has pole and zero symmetry about the imaginary axis. Our goal is 
to factor G as 
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G = G_~G_ 

G _,G _ G RHqq . 

This is called a spectral factorization of G and (?_ is a spectral factor. 

Theorem 1. Every G satisfying (l) has a spectral factorization. 

Proof. The proof is constructive. We begin by factoring G as 

G{s) = D + G 1 (s)+G 2 (s), (2) 

where D := G (oo), G, belongs to RH^ and is strictly proper, and G 2 ~ 
belongs to RH^ and is also strictly proper. It's worth mentioning in passing that 
such a factorization can be done using state-space methods, starting with a reali- 
zation of G and doing similarity transformation: 

G{s)=[A,B,C,D] 



A t 
A A 



B 9 



,[C l C 2 },D 



(A j stable, A 4 antistable) 



= Z> + {A v B v C v 0} + {A 4 ,B 2 , C t ,0]. 
The condition G~=G implies that 



i.e. 



G, 



G, 



G, 



G 1 + G 2 , 



i ~ G 2 — G x - G 2 ~ . 

The two sides of this equation are analytic in different half-planes and are strictly 
proper. By Liouville's theorem both sides equal zero. Thus G 2 =G ~ , so from 
(2) 



G=D+G l + G~. 



(3) 



Ch. 7 



95 



Bring in a minimal realization 
G 1 (s) = [A v B 1 ,C v O}. 



Then 



and 



wher 



Gr(s)=[-A{,-C[, Bf.O] 



G{s) = [A,B, C,D 



A :-- 



B := 



Ay 

-A? 



fl, 



(4) 



(5) 



C :=\C l B?]. (6) 

We are now set up to do a canonical factorization of G . From (4) we see 



that 



X + (A ) = Im 



(7) 



As in Section 1 bring in 

A x = A - BD~ l C 



Ay-ByD^Cy -B yD ~ l B \ 



C{D- X C X -\A x -B x D- x Gtf 



(8) 



Comparison of (8) and (2.1) shows that A x is a Hamiltonian matrix. Note that 
A x has no eigenvalues on the imaginary axis (this follows from the assumption 
G~ GRHqq), B x D~ l B j is positive semi-definite, and 

(Aj-B^-'CLfl^-'flf) 

is controllable (because (A v B x ) is). It follows from Theorem 2.2 and (7) that 
X_(A x ) and X + (yl ) are complementary. So by Theorem 1.1 G has a canonical 
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factorization. 

By virtue of Corollary 2.1 there exists a matrix X such that 



X_(A x ) = Im 



Defining 



T := 



I 

X I 



we get 



T^AT = 







? -A{ 
(question mark means irrelevant) 



T ~i B 



CT 



-(Cf+XBJ 



\C,+BlX B\ 



By analogy with (1.8) and (1.9) we have G = G + G _, where 

G + (s ) := [-A f , -(Cf +XB X ),B? , D } 

G_{8):={A 1 ,B 1 ,D-\C 1 +BTX),I} . 

Notice that G + = G_~D, so that G = G ~DG _. Finally, since D is positive 
definite (by (lc)) it has a square root D l ' 2 . Redefining G _ as 

G_(s )=[A l ,B l , D-WiCj.+BfXlD 1 ' 2 } , (9) 

we get the desired spectral factorization G — G ~ G _. U 

Exercise 1. Prove that if (A , B ) is controllable and D is positive definite, then 

[A-BD- l C , BD~ l B T ) 
is controllable. 
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A matrix G satisfying (1) also has a co-spectral factorization: 

G = G_G~ 

G _,G ~ 6 ^H^ . 
To get such a factorization, do a spectral factorization of H :=G : 

H = H~H_ 

H_,HZ 1 G RH ro . 
Then set G_=H? . 

In applications one may want to do spectral factorization of a matrix given 
in special form. The method of proof in Theorem 1 can be used to solve the next 
exercises. 

Exercise 2. Suppose G gRH^ and G (j oj) has full column rank for all 
0<w<oo. Derive the following procedure for a spectral factorization of G~G : 
obtain a minimal realization, 

G(s)=[A,B,C,D}; 

define 

P ~D T D 

A~BD- 1 D T C -BD_- l B T 

-C T C+C T DD~ l D T C ~{A-BD_- l D T C) T 

and find the unique matrix X such that 



H :-- 



X_(H) = Im 



then 



[A , B , P~ l l 2 {D T C +B T X), D 1 ' 2 } 
is a spectral factor of G~G . 



Exercise 3. Suppose G (s )=[A , B , C , I } is a minimal realization. Assume A 
is antistable and A -BC is stable. Derive a formula for a spectral factor of 
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G~G . 



Exercise 4. Suppose G GRH^ and 7>||G H^. Derive a formula for a spectral 
factor of q 2 -G~G starting from a minimal realization of G . 



7.4 Inner-Outer Factorization 

A matrix G in RH^ is inner if G~G =1. This generalizes the definition 
of an inner function. Observe that for G to be inner it must be tall (number of 
rows > number of columns). A useful property of an inner matrix is that its 
Laurent operator preserves inner products: 

<Gf ,Gh> = <f ,h>, f ,h£L 2 . 

It follows from this that pre-multiplication of an L^-matrix by G preserves 
norms: 

F€h O0 =^\\GF\\ oo = \\F\\ O0 . 
An example of an inner matrix is 

«+l 



s+V2 
1 



+V2 



A matrix G in RH^ is outer if, for every Re s >0, G (s ) has full row rank; 
equivalents, G has a right-inverse which is analytic in Re .■>■ >0. This general- 
izes the definition of an outer function. An outer matrix is wide (number of rows 
< number of columns). Of course, if G is square and 

G ,G~ e RH^ , 

then G is outer. Another example is that [F G } is outer if F and G are left- 
coprime RH^-matrices. 

An inner-outer factorization of a matrix G in RH ro is a factorization 

G = G, G„ 
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G { inner, G outer. 

It's a fact that every matrix in RH M has an inner-outer factorization. We shall 
only need the following, slightly weaker result. 

Theorem 1. If G is a matrix in RH^ and rank G (j w) is constant for all 
0<w<oo, then G has an inner-outer factorization with the outer factor being 
right-invertible in RH^. 



Proof. By Lemma 4.3.1 there exist square matrices G v H , K in RH^ satisfy- 
ing the equation 



K 





G l 


G = H 





having the properties 


ff-Sff- 1 € RH TO 


G j nonsi 


ngular. 



The rank assumption on G implies that G x (j u>) is nonsingular for all 0<w<oo, 
i.e. G f 1 GRL^. Define 



H 



Gi 




so that 

G = [F Q)K . 
Now F ~F has the properties 

F~F,(F~F)- 1 eRL^ 

(F~F)~ = F~F 

(F~F)(oo)>0, 
so it has a spectral factorization by Theorem 3.1: 

F~F =F ~F 

F ,F ~ l e RH TO . 
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Define 

G, := FF ~ l 
G„ := [F 0}K . 
Then G, is inner, G is right-invertible in RH^, and G =G, G„ . D 

Inner-outer factorization is relatively easy when G (jw) has full column rank 
for all 0<w<oo: let G„ be a spectral factor of G~G and then set G, -.— GG,,' 1 . 



Example 1. 

To do an inner-outer factorization of 
1 



G(s) 



8+1 

10s 

s +2 
s-1 



s+1 



we follow Exercise 3.2 to get a spectral factor of G~G . We have 
G(s)=\A,B,C,D} 



A = 



G = 



-3 -2 
1 

1 2 
-20 -20 

-2 -4 



, B 



, D 




10 
1 



Then 



D:=D T D 

H : = 
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T/or i r< T nn-ln T 



-BD~ l B 



IdT 



G J G+C J DD^D 1 C -{A-BD^D 1 C) 



.0198 

-2 



.0099 



1 1 



-2 -7.9604 -.0198 
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and 



XL(#) = Im 



X 



where 



X = 



27.25 30.43 
30.43 36.09 



Thus 



G (s) = \A,B, D~ l l 2 (D T C +B T X),D l l 2 } 

= 10 Q5 ( g +- 9 837)( S +.2861) 
(s +!)(« +2) 



Finally 



G i (s)=G(s)G (sT 1 
1 



10.05(5 +.9837)(s +.2861) 



s +2 
10s (s +1) 

L(*-i)(«+2)J 



A matrix G is said to be co-inner or co-outer if G T is inner or outer 
respectively. A co-inner-outer factorization has the form 



G — G cg G cl 



G co co-outer, G ct co-inner. 
An inner-outer factorization of G T yields a co-inner-outer factorization of G 



7.5 J-Spectral Factorization 

In this section we look at a rather special factorization required in Section 
8.3. We start with a real-rational matrix G x (s ) having the properties 

G i is strictly proper, (la) 

G i is analytic in Re s <0 , (lb) 

IM < i • (ic) 
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Define the matrices 



G :-- 



J := 



I 

1 
-I 



(2) 



(3) 



(not the same J as in Section 2). Our goal is to achieve the following J -spectral 
factorization of G : 

G~JG = G _~/G_ ( 4 ) 

G _,G Z G RHqq . 
Bring in a minimal realization 
G 1 (*)=|^i,5i, Cx.O]. 
Note that A 1 is antistable. Defining 



A := 



B := 



C 



-Al C*C X 



cf 

B t 
C x 



-J? J 



we have 



(G~JG)[s)= [A,B,C,J] 



Also 



A ~BJ~ l C 



-A{ 
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Bring in the controllability and observability gramians: 
A X L C +L C A[ =B y B[ 
A T X L + L A 1 =C?C l . 
As in Theorem 2.1 we have 



X_(A x ) = Im 



I 



X + (A ) = Im 



, ^*. + 



Exercise 1. Show that assumption (lc) implies that X_(A x ) and X + (A ) are 
complementary. 



From this exercise we can proceed with a canonical factorization of G ~JG 
Defining 



T := 



L, I 



N :=(I-L L C T\ 
we get (after some algebra) 



T~ l AT = 



T' y B = 



~NA fflT 1 
? A, 



NC[ -NL B 1 
-L c NCl N T B, 



GT 



OiL c 



C 



B l ~ B 1 L o 



Then we obtain from (1.8) and (1.9) that G~JF = G + G _, where 



G + {s) = [A u \-L c NCl N T B 1 ], 



-BfL 



\ 1J o 



,J] 



G_{s)= [-NA ?N-\ \NC I -NL B t ] 



C X L C 
Bl 



,1} 



(5) 
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It can be checked that G + = G~J, so that G~JG = G~JG_. For future 
reference we record that 



G_(«)- 1 =[-A 1 T ,[iVC 1 T -NL.BJ, 
which follows from (5) and some algebra. 



-CiL e 



,1), 



(6) 



Notes and References 

This chapter is based primarily on Bart, Gohberg, and Kaashoek (1979), 
Doyle (1984), and Ball and Ran (1986): Theorem 1.1 is from Theorem 1.5 of Bart 
et al. (1979); Theorem 2.2 and Corollary 2.1 are based on Section 2.3.3 of Doyle 
(1984); Section 3 is based on Section 2.3.4 of Doyle (1984); and Section 5 is from 
Ball and Ran (1986). 

Golub and Van Loan (1983) is an excellent source for numerical linear alge- 
bra, in particular the Schur decomposition. For the standard results on 
Lyapunov and Riccati equations see, for example, Wonham (1985). A standard 
reference for spectral factorization is Youla (1961). For a general treatment of 
inner-outer factorizations see Sz.-Nagy and Foias (1970). 



CHAPTER 8 



MODEL-MATCHING THEORY: PART II 



This chapter treats the model-matching problem in the matrix-valued case. 



8.1 Reduction to the Nehari Problem 

The model-matching problem is a lot harder when the T,- 's are matrix- 
valued functions than it is when they are scalar-valued. This section develops a 
high level algorithm for reducing the model-matching problem to the Nehari 
problem of approximating an RL^-matrix by an RH^-matrix; the Nehari prob- 
lem will then be treated in Section 3. 

Throughout this chapter the rank conditions of Theorem 6.1.1 are assumed 
to hold. 

The problem is sufficiently hard that we shall content ourselves with accom- 
plishing the following: to compute an upper bound 7 for a such that 7- a is less 
than a pre-specified tolerance; and then to compute a Q in RH^ satisfying 

\\ri-T 2 QT 3 \\ 00 < 1 . (1) 

Such a Q may not be optimal, but it will be as near optimality as we wish. 

To see the development more clearly, let's first do the case T 3 =I . Bring in 
an inner-outer factorization of T 2 , 

T 2 = Ui U 

U { inner, U g outer , 

and define the RL^-matrix 

Y :=(I-U t U^T,. 

If 7 is a real number greater than || 1^ ||oo» tnen tne matrix 7 2 - Y~Y has a spec- 
tral factor Y n . Define the RL^-matrix 
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r : = f/r^v 1 



Thus R depends on 7. 



Theorem 1. (T 3 =7) 

(i) a = inf{ 7 : || Y || 0O <^, dist(J2 , RHJ<1} 

(ii) Suppose 

-7 > a, Q,X GRH^, 

P-*lloo<l 
X = U e QY~ l . 

Thenlirx-r^l^^^ 



(2) 
(3) 



Part (i) of the theorem affords a method for computing an upper bound 7 for 
a; then part (ii) yields a procedure for computing a nearly optimal Q . A partic- 
ularly easy case of the theorem is when T 2 is square and nonsingular. Then U t 
is square, Y =0, Y =^I , R — 7 -1 C/,- ~ T 1( and part (i) reduces to 

a = dist(C/ I ~r l ,RH 0O ), 

i.e. a equals the norm of the Hankel operator with symbol U t ~T v This is just 
like the scalar-valued case of Section 6.2. 

The proof of Theorem 1 requires two preliminary technical facts. 
Lemma 1. Let U be an inner matrix and define the RL^-matrix 



E := 



U~ 
I-UU' 



Then \\EG || 00 =||G' H^ for all matrices G in RL^. 



Proof. It suffices to show that E ~E —I . But this follows easily from the fact 
that U~U=I: 



E~E = \U I-UU' 



U~ 
I-UU' 
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UU~ + {I-UU~){I-UU~) 



I . D 



Lemma 2. Suppose F and G are RL^- matrices with equal number of columns. 
If 



< 1, 



then 



and 



\G\L<1 



where G is a spectral factor of rf-G~G . Conversely, if (5) holds and 

rOL < i , 

then 



(4) 

(5) 
(6) 



<1 



Proof. We'll prove the first statement. Assume (4). Then (5) follows immedi- 
ately because 



IGIL < 



It follows in turn from Theorem 7.3.1 that rf-G^G has a spectral factorization: 
T 2 -G~G =G~G (7) 

To prove (6), define 
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e :=7- 



F 

G 



(8) 



let / be an L 2 -vector of unit norm, and define g :— G 1 f . Starting from (8) 
we have in succession 



F 

G 

F 
G 



ffllz < (T-e)||s , || 2 



9} < {l-t) 2 <9,9 > 



<g,{F~F +G~G )g > < <f <9 ,g > - e{2 1 -€)\\g || 2 2 
<g,F~Fg> < <g ,tf-G~G )g > - e(2 7 -e)||G || ro 2 . 
The last step used the inequality 

l = l|G ff|| 2 < ||G ||oollff|| 2 - 
Now using (7) we get 

11^ ll2 2 <HG ff|| 2 2 - ^-T-eJHGJI^. 
Hence 

\\FG - 1 f\\i<l-e(2' t -e)\\G \\^. 
Since / was arbitrary we find that 

\\FG ~ l \\l<l--e(2 1 -e)\\G \\^. 
Since e(2~f-e)>0, we arrive at (6). D 

Exercise 1. Prove the converse in Lemma 2. 

Proof of Theorem 1. 

(i) Let 

7 m/ :=inf {7: H^IL < 1, dist(£ , RHJ < 1} . 

Choose e>0 and then choose 7 such that a+e>'y>a. Then there exists Q in 
RH„ such that 



|r 1 -r 2 Q|| 00<7 . 
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Equivalently, from Lemma 1 



Now 



V t 



I-U, C/,' 



u~ 

I-U.U/ 



(T l -T 2 Q)\\ 0O < 1 . 



(9) 



?2 = 



u 





so (9) is equivalent to 

Y IL < T • 

This implies from Lemma 2 that 

imioo < i 

||c/rr 1 n- 1 -c/ gr - 1 || oo <i. 

The latter inequality implies 

dist(JK, f/ RH oo r o - 1 )< 1. 



(10) 
(11) 

(12) 



But U B is right-invertible in RH^ (Theorem 7.4.1) and Y is invertible in RH^,. 
Therefore 

U RHqq Y~ = RH^, , 

so (12) gives 

dist(£ , RH J < 1 . (13) 

From (10), (13), and the definition of i tn j we conclude that q in f<7. Thus 
7,vf<a+e. Since e was arbitrary, 7,„f<a. 

For the reverse inequality, again choose e>0 and then choose ^ such that 
Tm/+ e >'V>Tm/- Then (10) and (13) hold, so (11) holds for some Q in RH^. 
Lemma 2 now implies that 



V~T X -U Q 
Y 



<!■ 
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Finally, this leads (as above) to 

\\T 1 -T 2 Q\\ 0o < 1 . 

Thus a<7<7,„/+e, so a<l m} . 

(ii) This part follows from the previous paragraph. D 

We saw in Section 6.2 that 

dist(fi , RHJ = dist(i? , HJ 
for a scalar- valued R in RL^. This is true in the matrix- valued case too. 

Lemma 3. For R in RL m 

distOR , RHJ = dist(i? , HJ = ||r s || . 

Proof. We have 

dist(* , RH ro ) > dist(i? , HJ = \\T R || , 

the latter equality being Nehari's theorem. Choose e>0 and set (i:=\\y R ||. 
Then 

distK/J+e)" 1 ^ , HJ = {fi+ey^Tjt \\ 

= w+o 

< 1 . 

As we shall see in Section 3, this inequality implies there exists X in RH^ such 
that 

WiP+eyiR-XW^Kl. 
Thus 

dist(# , RHJ < p+e 

= dist(i? , HJ+ f . 
Since e was arbitrary 

dist(JZ , RHJ < dist(fl , Hoo) . □ 
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It can also be proved that the distance from R to RH m is achieved, i.e. 
there exists X in RH^ such that 

p-X|| 0o = dist(i?,RH oo ). 

Based on Theorem 1 and Lemma 3 we have the following high level algo- 
rithm for finding nearly optimal Q 's in the case T 3 =I . 

Step 1. Compute Y and 1 1 ^ j | oo - 

Step 2. Find an upper bound a t for a. 

Step 3. Select a trial value for 7 in the interval ([| j^ ||oo> a i]* 

Step 4. Compute R and \\T R ||. Then \\T R \\ < 1 iff a < 7, so increase or 
decrease the value of ^y accordingly and return to Step 3. When a sufficiently 
accurate upper bound for a is obtained, continue to Step 5. 

Step 5. Find a matrix X in RH^ such that \\R -X^ < 1. 

Step 6. Solve X = U g QY ~ l for Q in RH TO . 

Step 1 involves implementation of the procedures in the previous chapter for 
spectral factorization and inner-outer factorization. l'"or Step 2 the simplest 
bound would be ai=|| TiHoo- Then binary search could be used to iterate on -7. 
Step 6 is not too difficult if U is square; this happens when T 2 is tall, which fre- 
quently can be arranged by suitable problem formulation. When U isn't square, 
Step 6 would be more difficult. How to do Step 5 is the subject of the next two 
sections. 

Example 1. 

Let's continue with the tracking example of Chapter 3, again taking 
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W(s) = 



«+l 



10s +1 
and p=l. In Section 4.5 we obtained 

5+1 

10s +1 
T 1 (s)=T 3 ( 8 )= [ Q 

s-1 

s 2 +s+l 
T 2( s )~ s ( a _ 2 ) 

s 2 +s +1 
The matrix Q is of dimension 1X2: 

Q =[Qi Q 2 \ ■ 

Notice that QT 3 =WQ V Thus by the substitution 

T 2 ^WT 2 
we arrive at a model-matching problem with T 3 ~I , namely 
minimize || T 1 -r 2 (2 1 || 00 , C^eRE^. 
The previous algorithm produces the following results. 



Step 1. 



U,{*) = 
Y(s) 

llriL 



-5+1 

s (s -2) 



s 2 +V7s+l 

■ (g 2 +V7g+l)(g+l) 
(s 2 +s+l)(10s+l) 

s_+l 

(10s+l)(s 4 -5s 2 +l) 

= 0.1683 



s 2 (s 2 -4) 
-s(s+l)(s-2) 



Step 2. 



a i — \\T 



llloo 
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Steps 3 and 4. From the first two steps we know that a lies in the interval 
[.1683, 1]. We now use binary search on 7 in this interval, testing if \\T R || < 1. 
To locate a within, say, 3% of the length of this interval, we need five iterations: 



7 


l|r*|| 


0.5846 


0.3710 


0.3774 


0.5996 


0.2729 


0.8952 


0.2206 


1.2327 


0.2468 


1.0317 



We conclude that a lies in the interval [.2468, .2729]. Let's proceed with 
-7=.2729. Then a spectral factor of ~f 2 -Y~Y (calculated via Theorem 7.3.1) is 

s 3 +2.678s 2 +1.081s +.1075 



Y (s ) = .2539 
There follows 

R(e) = 



{s +.l){s 2 +V7s +1) 
.3938(s +l) 2 (s 2 +y / 7s+l) 



(s 2 -y/7s + l)(s 3 +2.678s 2 +1.081s +.1075) 
(The values of ||r R ]| in the above table are calculated as in Example 5.2.1.) 



Step 5. Since R is scalar-valued, Theorem 6.2.1 can be applied to find the 
closest function X in RH^. 

R (s ) = [A , B , C , D } + (a function in RH^) 



A = 



2.189 
.4569 



B = 

C = [1 1 

L r = 



1.038 
-.9291 



.2463 


-.3646 


-.3646 


.9448 


2284 


.3780 




3780 


1.095 
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X := maximum eigenvalue of L c L 
w := corresponding eigenvector = 



.3338 
1.076 



= .8952 

-.3466 
1 



X(s) = R{s)-\{A,w, C,0}X[-A T ,v,B T , O]^ 1 

= 8952 ^ 2+v ^ s +1 X S 2 + 2 - 656s +1-033) 

(s +3.108)(s 3 +2.678s 2 +1.081s +.1075) 



Step 6. 



Q =[Qi Q 2 \ 

Q 1 (s) = X(s)Y (s)U (sT 1 



= 2.273 



{s 2 +s +l)(s 2 +2.656s +1.033) 



(s +l)(s +3.108)(s 2 +\/7s +1) 
The function Q 2 is unconstrained, and hence may be set to zero. 
Finally, the controller K is computed using formula (4.5.2). 
K =[C t C 2 ] 



2 2?3 (g +l)(e 2 +s +l)(a 2 +2.656g +1.033) 
(s +3.108)(s 2 +V7s +l)(s 2 +5s -18) 
32s -1 



C 2 («) 

s"+5s-18 

Notice that C x is unstable, so the controller can't be implemented as shown in 
Figure 4 of Chapter 3. (This is a case where every stabilizing controller is itself 
unstable.) The theory guarantees, however, that C 2 contains the unstable factor 
of C v This unstable factor would be moved past the summing junction into the 
loop. 

The properties of this design are illustrated in the Bode magnitude plot of 
Figure 1. The transfer function, say H v from reference r to tracking error r -v 
has magnitude less than -10 db over the frequency range [0, .1], approximately 
the bandwidth of the weighting function W (smaller tracking error could be 



C'll. N 



I 15 



4- 



0- 



dl) -4 - 



-8- 



■12 - 



0.01 




100 



0t' 



Figure 8.1.1. Magnitude Bode plot of II [f Example 1 
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obtained by reducing the weighting p on control energy), it peaks to about 4 db, 
and it rolls off to db at high frequency, as it must for a proper controller. The 
actual quantity being minimized is the H^-norm of the transfer matrix 

H X W~ 

where H 2 is the transfer function from r to u . This norm equals the supremum 
of 

{\H 1 (jw)\* + \H 2 (jw)\y/ 2 \ W(jw)\ 
over all w. For our design this function is very nearly flat at -11.3 db. 

The generalization when T 3 f^I uses the following definitions: 

T 2 — U t U , U i inner, U outer 

Y:=(I-U t Vr)T 1 

Y = spectral factor of 7 2 - Y~Y 

T 3 Y ~ = V c0 V ct , V co co-outer, V ci co-inner 

Z :^UrT 1 Y -\l^V c rV ct ) 

Z co = co-spectral factor of I-ZZ~ 

R -^ZjUrTyY^V-. 

Notice that R ,Y ,Z ^KL^ and XeRH^; Y is a function of T 1 and T 2 ; R and 
Z are functions of 7, T v T 2 , and T 3 . The matrix Z c ~ l U is right-invertible 
over RH^ and V c0 is left-invertible over RH TO . 

Theorem 2. 

(i) a = inf{ 7 : ||r IL < % \\Z ^ < 1, dht(R , RHj < 1} . 
(ii) Suppose 

1> a, Q,X GRH^ 

\\R-X\UK1 



Ch. 8 117 

X =Z C - 1 U B QV C0 . 

Then||r 1 -r 2 gr 3 || 00 < 7 . 

The proof is analogous to that of Theorem 1 and is therefore omitted. The 
general high level algorithm is as follows: 

Step 1. Compute Y and ||r|L. 

Step 2. Find an upper bound ccj. for a. 

Step 3. Select a trial value for 7 in the interval (H^Hoo, «i]- 

Step 4. Compute Z and \\Z W^. 

Step 5. If \\Z H^j < 1, continue; if not, increase 7 and return to Step 4. 

Step 6. Compute R and \\Y R ||. Then \\Y R || < 1 iff a<7, so increase or 
decrease the value of -7 accordingly and return to Step 3. When a sufficiently 
accurate upper bound for a is obtained, continue. 

Step 7. Find a matrix X in RH^ such that \\R -XW^ < 1. 

Step 8. Solve X = Z C ~ 1 U QV C0 for Q in RH TO . 



8.2 Krein Space 

This section introduces a geometric structure which will be used to solve the 
Nehari problem. 

Let X and Y be two Hilbert spaces. There is a natural way to add them 
together to get a third Hilbert space, their external direct sum X ® Y. We shall 

H 

represent vectors in X © Y like this: I I . As a set, X © Y consists of all 
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such vectors as x ranges over X and y over Y. Vector addition and scalar mul- 
tiplication; are- idefined> componentwise, and the 'inner product is defined as follows: 



M 



^11 ■! -Iv 

= <%V X 2> + <VvV2> 



Now introduce in addition an indefinite inner-product on X © Y: 

yi riy 2 i ■ < x v x 2> ~ <2/i>«/ 2 > 

iT'Msiis indefinite because \z,,z,\ can be negative, zero, or positive, depending on 
the particular z in X © Y. A more compact way of defining [ , ] lis to introduce 
the operator / on X © Y: 

Then [z^Zj] := <z lt Jz 2 >- The external direct sum X © Y together with the 
above indefinite inner-product is called a Krein space. 

A vector z in X ffi Y is negative if [z ,z ]<0, and a subspace of X © Y is 
negative if all its vectors are negative. 

Consider an operator ,$ from Y to X. Its graph is a subspace of X © Y, 
namely, 



{(?)-}• 



It is an elementary fact (and is easy to prove) that the graph is a closed subset of 
X© Y. 

Example 1. 

Let F be a matrix in RH^ and consider the compression to H 2 of the 
Laurent operator with symbol F (i.e. the Toeplitz operator). Its graph is 



{(?) : » £I 4 



or equivalently 
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F 
I 



H 



2 > 



where we use the notation 

MH 2 := {Mg : g £H 2 } . 
This graph is a subspace of H 2 © H 2 . It's convenient to denote it by G F 

Example 2. 

If F GRL^, then the corresponding graph 



Gp : — 
lives in L 9 © H, 



H„ 



8.3 The Nehari Problem 

This section solves the Nehari problem posed as follows: given R in RL^ 
with dist(i? , RH 00 )<1, find all X's in RH^ such that \\R -X^O. Only some 
of these X 's are closest to R , i.e. satisfy 

\\R-X\\ oo = dJ3t(R t RH O0 ). 

We saw in Lemma 1.3 that the distance equals ||r^ ||, so the standing assumption 
in this section is that Hr^ ||<1. 

We may as well assume in addition that R is strictly proper and analytic in 
Re s <0, i.e. R "GRH^. Otherwise, factor R uniquely as 

R = R y ~\~ R % 

R ]~ ,R 2 6 RHqq , R j strictly proper . 

We've already observed in Chapter 5 that Y R =T R . So to solve the Nehari 
problem for R , solve it for R 1( i.e. find all Xj's in RH^ such that 
ll^i-^illoo^ 1 * and then set X — X x +R 2 . 
We need a preliminary fact. 
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Lemma 1. Let F GRL^. Then fH 2 CH 2 iff F GRH^. If F is square and 
FH 2 =K 2 , then F^GRH^. 

Proof. The implication 

F GRH TO =»FH 2 CH 2 
is easy (and has already been noted in Theorem 2.4.2). 

Suppose FH 2 CH 2 . Since each column of the matrix (s +l) _1 J belongs to 
H 2 , the same is true of (s + 1) _1 .F (s ). Therefore, this latter matrix is strictly 
proper and analytic in Re s >0. Hence F is proper and analytic in Re s >0, i.e. 
F GRH^. 

Finally, suppose F is square and F H 2 =H 2 . Then there exists a matrix G , 
each of whose columns belongs to H 2 , such that 

F{s)G(s) = {s+iy 1 I . 

This implies that F has an inverse in RH^, namely, (s + 1)G (s ). □ 



In terms of S :== R ~X , a problem equivalent to the Nehari problem is this: 
find all S"s in RL^ such that pU^^l and R ~S GRH TO . The next lemma 
gives geometric characterizations of these two conditions. Define the RL^- 
matrix 



G := 



/ R 
/ 



(1) 



Lemma 2. Let S eRL^. Then ||S Hoo^ 1 iff G S is negative, and R -S GRH C 
iff 

G s <Z G (H 2 © H 2 ) . 



Proof. Suppose ||S'|| 0O <l. A vector in G 5 has the form , 
H 2 . This vector is negative: 



Sf 



for some / in 
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(7K7) 



= \\sf ll 2 2 HI/ 112 

< (l|5||4,-i)||/ \\i 

< o . 

The converse is equally easy. 

Now suppose R ~S ERH^. Then by Lemma 1 
(S -R )H 2 C H 2 , 



so 



G, 



H, 



C 



/ R 

/ 

1 R 
/ 



S-R 
I 



H 9 



(H 2 © H 2 



G (H 2 H 2 ) 



Conversely, if 

G s C G (H 2 © H 2 ) , 
then 



H 2 C 



I R 
/ 



(H 2 © H 2 ) 



P re-multiply by [-/ R } to get 

(R-S)H 2 CH 2 , 
which implies by Lemma 1 that R ~S gRH^. U 



In view of Lemma 2 we would like to be able to characterize negative graphs 
contained in G (H 2 © H 2 ). The /-spectral factorization of Section 7.5 was 
introduced for this very purpose. Following Section 7.5 we have 
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G~/G = G yJG_ 
G _,G Z £ R-Hoo 



I 
-I 



Define 

L := GGS 1 . 
Then from (2) 

L ~JL = J 
and from (3) 

L (H 2 © H 2 ) = G (H 2 © H 2 ) . 



(2) 



(3) 
(4) 
(5) 



A square matrix M in RL^, having the property M~ JM =J is said to be 
J -unitary . Such a matrix is invertible in RL^; in fact the inverse of M is 
JM~ J ' . The usefulness of /-unitary matrices derives from the fact that (under 
mild conditions) they map negative graphs into negative graphs. The precise 
statement is as follows. 

Lemma 3. Let X be an RL^-matrix with llXl^^l. Suppose M is a J- 
unitary matrix having the properties 



MG X C L 2 © H 2 

{0} © H 2 C M (L 2 © H 2 ) . 



(6) 

(7) 



Then there exists Y in RL TO such that Hy^O and G y =MG x . 



Proof. Define 










72j 




M 






Then from (6) 






L r 2. 


H 2 


= MG X C L 2 © H 



(8) 
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so p re-mult ip lying by [0 /] we get F 2 H 2 CH 2 . Then by Lemma 1, r 2 £RH c 
We shall show that Yf 1 GRH^. This takes three steps. 

Claim #1: MGj is a negative subspace of L 2 © H 2 . 

A vector in M G x has the form, 

'Xf 



M 



f 



for some / in H 2 . Then 
'Xf) ..[Xf 



M 



7W7 



M, 



Xf 



,JM 



Xf 



Xf 



f \,M~JM\ f 



Xf 




The last quantity is <0 because G x is negative. 
Claim #2: l r 2 H 2 is a closed subspace of H 2 . 
Suppose {/ k } is a sequence in F 2 H 2 which converges to some / in H 2 . 



Then 



fk 



1% 



H„ = MG, 



for certain vectors h k in L 2 . Since {f k } is Cauchy and MG X is negative, it fol- 
lows that {h k } is Cauchy too, so it converges to some h in L 2 . Since G^ is 
closed and M ,M~ l €KL 00 , it follows that MG X is closed. Thus 



^i 



H 



2 > 



so / er 2 H 2 . 

Claim #3: r 2 H 2 =H 2 . 
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Suppose otherwise. Since Y 2 H 2 is closed, we have 

h 2 = (r 2 H 2 )-L e (r 2 H 2 ) . 

Let g be a nonzero vector in (i^H^and define 



/i 

/ 2 



M~ 



(9) 



Then (7) implies that / 2 6H 2 , so g and Y 2 f 2 are orthogonal. Now we get 
= -<g,Y 2 f 2 > 



0| I IV 2 
(/| >V 2 



Ml , |,/M 



/1 



fx/ f) 

( /* J 



from (8) and (9) 



Xf 2 
f 2 l' M ~ JM \f 2 



if 1 



,J 



Xf 2 
/ 2 



Thus 



<f v Xf 2 >-\\f 



2II2 



|/ 2 ||2 2 = <f l,Xf 2 > 

<H/lll 2 l|X/ 2 || 2 
<H/lll2l|/2ll2, 



|/ 2 II 2 < 11/ ll 



(10) 



/l 



IS 



But I is strictly negative and M * is / -unitary. This implies that , 
strictly negative too, i.e. 

11/ ill! < ll/ 2 ||2- (11) 
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Inequalities (10) and (11) are contradictory. 

It follows from the third claim and Lemma 1 that Y% 1 ERH Kl . Defining 
Y := r i r 2 " 1 , we have FgRL^ and 



Yt 



H 2 — G r 



(12) 



From the first claim G Y is negative, so from Lemma 2 ||^||oo^l- Finally, (8) 
and (12) imply that G Y = MG X . □ 

Now we have the solution to the Nehari problem as posed in terms of S . 

Theorem 1. The set of all matrices S in RL^ such that ||S||oo5il and 
R -S ERHpo is given by the formulas 



X, 



= L 



TGRH^, llrlLXl. 



Proof. First suppose 



By Lemma 2 and (5), G s is negative and 

G s C L (H 2 © H 2 ) . 
Define M:=L~\ From (13) 

IGj C H 2 © H 2 , 
so (6) holds with S substituted for X . Also 

L ({0} © H 2 ) C L (H 2 © H 2 ) 

= G (H 2 © H 2 ) from (5) 



/ R 
I 



(H 2 © H 2 ) 



(13) 
(14) 
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C L 2 © H 2 , 



so (7) holds. Noting that M is /-unitary, invoke Lemma 3 to get the existence 



of Y in RL^ such that 



\\Y\L< 1. G y =MG S . 
Since G Y C H 2 © H 2 from (14), we have by Lemma 1 that actually FGRH^. 



Define 



so that 



X l 

x 2 



H2 I Gy 



= G< 



H, 



(15) 



Pre-multiply (15) by [0 7] to get A' 2 II 2 =H 2 . Thus Xf 1 GRH^ by Lemma 1. 
Now pre-multiply (15) by [7 -5 ] to get S =X 1 X 2 ~ 1 . 
Conversely, suppose FeRH^ and ||y H^l. Then 
LG Y C L (H 2 © H 2 ) 
= G (H 2 © H 2 ) 
C L 2 © H 2 , 
and 

{0} © H 2 = G ({0} © H 2 ) 
C G (H 2 © H 2 ) 
= L (H 2 © H 2 ) 
C L (L 2 © H 2 ) . 
Invoke Lemma 3 again: there exists S in RL ro such that ||-S'|| 00 <1 and 
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G s =L G Y ■ Define 



X, 



Then 



X, 



H, = G 



s > 



so S =X t X 2 l as before. D 

The formulas in Theorem 1 yield 5 as a linear fractional transformation of 
Y . To see this, partition L as 

L j L 2 

1 = L L 

Then 

S ^(L.Y+L^L.Y+L,)" 1 . 
One possible candidate for Y is Y =0, in which case S is simply L 2 L 4 _1 . 

Let's summarize the results of this section in the form of an algorithm. The 
input is a matrix R in RL^ having the properties 

R is strictly proper 

l|r*||<i 

and the output is a matrix X in RH TO such that \\R -X Ho^l. 

Step 1. Find a minimal realization of R : 
R {s ) = [A , B , C , 0] . 



Step 2. Solve the Lyapunov equations 

AL C + L C A T = BB T 
A T L„ + L n A = C T C 
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and set N:={I-L L c y 1 . 

Step 3. Set 

Ii(*) = [A,-L e NC T , C,I] 
L 2 {s)=[A,N T B,C,0} 
L 3 ( s )=\-A T ,NC T ,-B T ,0} 
L i (s)=[-A T ,NL B,B T ,I} . 

Step 4. Select Y in RH^ with ||y||oo<l (for example Y =0) and set 
X =R -{L.Y+L^L^+L^ 1 . 



Exercise 1. Derive the formulas for L, in Step 3 using the equations 
L = GG l , 



G(s)=[A,[0 B], 
and (7.5.6). 



,1], 



Example 1. 

Let 

R(s) 



.5 

s-1 

1 





2 



2 -s +1 «-l 

We computed in Section 5.2 that dist(i? , RH 00 )=1.2695. Scale i? by, say, 
1/1.28, i.e. redefine R to be 



R(s) = 



1 


.5 

s-1 





1.28 


1 


2 




s 2 -s +1 


s-1 
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The algorithm applied to this matrix has the following results: 



Step 1. 



R [s ) = \A , B , C , D 



A 



C 



2 


-2 10 




1 





1 


, B = 





1 





-.5 .5 
1 -1 1 



7813 




















1.563 



D = 



Step 2. 



N = 



-.1017 











.2035 








1.221 



.2035 

.1017 

-.1017 





.6250 -1.125 .6250 ^.3333 

-1.125 2.625 -1.625 1 

.6250 -1.625 1.125 -.6667 

-.3333 1 -.6667 .5 

1.507 -5.447 5.093 -29.27 

-1.253 15.69 -13.83 79.37 

.7918 -9.640 10.11 -52.14 

-.5339 6.614 -6.251 36.90 



Step 4. Take Y =0. Then 

X{8) = R{ 8 )-L 2 {s)L 4 (s)- 1 

= \A,B,C ,0]-\A,N T B, C,0}X{~A T ,NL B,B T ,1}^ 



A 

A -N T BB T 

-A T -NL n BB T 





B 


? 


N T B 




NL B 



\C -C 0] , 
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The resulting X(s ) is 

-4.315(5 +2.473)(s 2 +1.206s +.3951) 10.47(s +1.054)(s +1) 2 



(s +84.53)(s +1.101)(s 2 +1.381s +1.110) {s +84.53)(s +1.101)(s 2 +1.381s +1.110) 
31.10(s +1.086)(s 2 +1.118s +.7120) -76.19(s 2 +1.279s +.9036) 



(s +84.53)(s +1.10l)(s 2 +1.381s +1.110) {s +84.53)(s 2 +1.381s +1.110) 



Exercise 2. Take 

r 1 («) = -^— , t 2 {b) = -^-, r 3 ( 8 ) = i. 

First, compute a using the method of Section 6.2. Second, choose 7>a (keep 7 
variable) and define R as in Theorem 8.1.1. Third, using the previous algorithm 
find an X in RH^ such that HiJ-XH^ < 1. Fourth, get Q from Theorem 
8.1.1. This Q depends on 7. Finally, in the coefficients of Q let 7 tend to a. 
You should get the optimal Q for the original model-matching problem. 



8.4 Summary: Solution of the Standard Problem 

We have completed our solution of the standard problem posed in Chapter 
3. The first step is to reduce the standard problem to a model-matching prob- 
lem; state-space tools for this reduction are given in Section 4.5. In the single- 
input, single-output case the model-matching problem is relatively easy to solve 
(Section 6.2). In the multi-input, multi-output case the model-matching problem 
is reduced to the Nehari problem (Section 8.1). Finally, the Nehari problem is 
solved in Section 8.3. 



Notes and References 

The first multivariable H^ problem to be solved was the disturbance 
attenuation problem (equivalently, the weighted sensitivity problem): Chang and 
Pearson (1984) used matrix interpolation theory, Francis, Helton, and Zames 
(1984) used the geometric Ball-Helton theory, and Safonov and Verma (1985) 
used operator theory. 
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The approach of Section 1, called ^-iteration, is due to Doyle (1984); Francis 
(1983) independently developed a similar approach for the case T 3 = I . A 
detailed treatment of ^-iteration is contained in Chu, Doyle, and Lee (1986). 

For a comprehensive treatment of Krein spaces see Bognar (1974). The 
approach in Section 3 to the Nehari problem is due to Ball and Helton (1983) and 
Ball and Ran (1986); the proofs are modifications of those in Francis et al. (1984). 
An alternative state-space approach to the Nehari problem (which inspired the 
work of Ball and Ran) is that of Glover (1984). 

An alternative approach to the model-matching problem is that of Kwaker- 
naak (1985, 1986). 

In the scalar-valued model-matching problem the value of a can be com- 
puted directly: it's the norm of a certain Hankel operator. The method presented 
in this chapter for computing a in the matrix-valued case is iterative: a isn't 
equal to the norm of a Hankel operator except in the very special case where T 2 
and T 3 are both square and nonsingular. Feintuch and Francis (1986) showed 
that a equals the norm of a certain (non-Hankel) operator. An alternative for- 
mula has been derived by Young (1986b). This latter formula is simple enough 
to state here. Define two subspaces X and Y of L 2 , 

X := T 3 H 2 

:={/ GL 2 : T 3 f £H 2 } 

Y := orthogonal complement of T 2 H 2 in L 2 . 

Now define the operator S from X to Y as follows: 

3/ := orthogonal projection of T 1 f onto Y , / £X . 

Then Young's formula is a = ||3||. Along these lines, an alternative approach to 
computing a is that of Jonckheere and Juang (1986). 
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PERFORMANCE BOUNDS 



For some simple examples of the standard problem it's possible to obtain 
useful bounds on achievable performance, sometimes even to characterize achiev- 
able performance exactly. This brief chapter presents three illustrative examples. 

Figure 1 shows a feedback system with a disturbance signal w referred to 
the output of the plant P . As usual, P is strictly proper and K is proper. The 
transfer matrix from w to y is the sensitivity matrix S := (I -PK ) _1 . 

Suppose first that the spectrum of w is confined to a pre-specified interval of 
frequencies [0,0^], Wj>0. Then the problem of attenuating the effect of w on the 
output y of the plant is equivalent to that of making ||S(y w)|| uniformly small 
on the interval [O,^]. As in Section 6.3 introduce the characteristic function 

x(jw) := 1 if | w | <w l 

:= if | w | >Wj . 

Then the maximum value of ||5(jw)|| over the interval [0,0^] equals the L^,- 
norm of \S . It may happen that as we try to make \\xS H^ smaller and smaller, 
the global bound \\S H^ becomes larger and larger. This is unpleasant because a 
large value of \\S [^ means the system has poor stability margin. Think of the 
scalar-valued case: if US' Hoc is large, then | l~{PK){j w) | is small at some fre- 
quency, i.e. the Nyquist plot of PK passes near the critical point s =1. 

The first result says that if P is minimum phase (in a certain sense), then 
\\xS Hoo can be made as small as desired while US' H^ is simultaneously main- 
tained less than any bound 6. Of course, 8 must be greater than 1 since 
\\& lloo^l f° r eve ry stabilizing K . 
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Figure 9.1. Disturbance rejection 
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Theorem 1. If P has a right-inverse which is analytic in Re s >0, then for 
every e>0 and <S>1 there exists a stabilizing K such that 



11x5 | 



< e 



< 6 . 



Proof. The idea is to approximately invert P over the frequency range [0, wj 
while rolling off fast enough at higher frequencies. The first step is to 
parametrize all stabilizing K 's as in Section 4.4. Bring in a doubly-coprime fac- 
torization of P : 

P = NM' 1 = M~ l N 



X ~Y 

-N M 



M Y 

N X 



I . 



Then the formula for K is (Theorem 4.4.1) 
K ={Y~MQ){X-NQY l 
= {X-Q~N)- l {Y-Q~M) 

Q e RH^ 

With these two representations of P and K and using (2) we get 
S = (X-NQ )M . 

Now fix e>0 and <5>1. Choose c >0 so small that 
c||XM|| 0O <min(e, 6) 
(1+c) 2 <6. 
It follows from (2) that 
XM - NY = / . 
Since P is strictly proper, so is ./V . Hence 

X(oo)M(oo) = I , 
so that 

||X(oo)M(oo)|| = l. 



(1) 
(2) 

(3) 



(4) 

(5) 
(6) 



C'h. 9 



13. r > 




Figure 9.2. For proof of Theorem 1 
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Since ||X(j w)M(j w)|| is a continuous function of w, it's possible to choose 
w 2 ^w 1 such that 

\\X (j u)M {j u>)\\ < 1 + e , w>w 2 . (7) 

Bring in another characteristic function, 

X2(i w ) == 1 , I w I < w 2 
:= , | w | > w 2 , 

so that (7) is equivalent to 

IKl-X^XAflL^ l + c . (8) 

The assumption on P implies that N has a right-inverse, say iV rt - , which is 
stable (but not proper). Choose a scalar-valued function V in RH^ with the fol- 
lowing three properties: 

VN n is proper 

■ itau-niL < c (») 

||i-v|L<i + c • ( 10 ) 

The idea behind the choice of V can be explained by the picture of the complex 
plane in Figure 2. The Nyquist plot of V should lie in the smaller disk up to fre- 
quency w 2 (inequality (9)), and in the larger disk thereafter (inequality (10)). In 
addition, V should roll off fast enough so that VN H is proper. For example, V 
could take the form 

V{s)= l — r 

(TS+lf 

for large enough r and k . 
Finally, take Q to be 
Q := VN ri X . 
Substitution into (4) gives 

S = {X-VNN ri X)M 
= (l-V)XM . 
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Thus 

IIX-SHoo < ||X2'S'||oo sincew 2> w l 

= \\x 2 {i-v)xm\\ O0 

< 11X2(1-^)110011^1^ 

< c||XM|| 00 from(9) 

< min(£, S) from (5) 
and 

ll(l-X2)S||oo = ll(l-X 2 )(l-^)XM|| 00 

<l|l-^llooll(l-X 2 )-Of||oo 

< (1 + c ) 2 from (8) and (10) 

< 6 from (6) . 
The two inequalities 

llx2^IL<«5 

ll(l-X 2 )S||oo<<5 
imply||5|| 00 <^.a 

On the other hand, if P has a zero in the right half-plane, then ||S K^ must 
necessarily increase without limit if \\xS W^ tends to zero. This might be 
described as the "waterbed effect". 

Theorem 2. Assume there is a point s in Re s >0 such that the rank of P (s ) 
is less than the number of its rows. Then there exists a positive real number a 
such that for every stabilizing K 

HxS||ooPlloo>l- 

Proof. We continue with the notation of the previous proof. Fix some stabiliz- 
ing K . Then S must have the form 
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S = {X-NQ)M . (4bis) 

By assumption, there is a point s in Re s >0 and a nonzero complex vector £ 
such that 

x'P{s ) = 0. (11) 

Scale x so that a; x =1. 
It is claimed that 

x ' S (s ) = x * . (12) 

To see this, first get from (2) that 

MX -NY = I . 
Pre-multiply by M" 1 , post-multiply by M , and use (1) to get 

XM - PYM = / . 
Therefore from (11) 

x* X{s )M{s ) = x* . (13) 

Similarly, (11) also implies that 

x*N(s ) = 0. (14) 

Then (13) and (14) imply (12). 

Map the right half-plane onto the unit disk via the mapping 
s ~s 



s +s 
s o~ s o z 



1 + 2 

The point s==s is mapped to the origin, z =0, and the interval [0,ywj] is 
mapped onto an arc 

{e i$ :e x <e<9 % } . (15) 

Let (j) be the angle subtended by this arc, i.e. (j>= \ O 2 -0 1 | , and define 

R(z):=S[(8 a -7 z)/{l+z)}. (16) 

Then R is rational and analytic in the closed unit disk. Moreover 
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maxjp M*)j|rfi<*<' 2 } = 11x5 |L (17) 

and 

max{p (e ^)||:0<<K27r} = ||5 IL . (18) 

Now let n be any integer greater than 2n/<j> and define 

T{z):= R{z )R {ze > 2lT / n )...R {ze > 2 < n ~ l V n ) . (19) 

Then T is rational and analytic in the closed unit disk. Since the angle 2n/n is 
less than <j>, at least one of the n points 

«>«»*/» , jfc=o,...,n-l 

lies in the arc (15) for each z on the unit circle. Thus from (19) 

c :=max{||r(e'*)||:O<0<27r} 

< [max{|| J R(e^)||:0<^<27r}]"- 1 

X [max{p(e^)IK<0<0 2 }], 
or from (17) and (18) 

c ^PIUMIU, (20) 

where a :=n-l. Notice that n depends only on (f>, which in turn depends only/ 
on s and uj 1 . Thus a is independent of K . It remains to show that c >1. 

Now (12) and (16) imply 

x * R (0) = x * . 
This and (19) yield 

x ( T (0) = x * . 
Thus 

x* T(0)x = x ' x = 1 . (21) 

But x T(z)x is analytic in the closed unit disk. So (21) '. and the maximum 
modulus theorem imply that 

| x* T{e l6 ")x | > 1 (22) 

for some 6 . Now 
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c 2 = max max y * T (e > e )T (e > *)* y 
9 y'y=l 

>x'r(e'V(^'Y^ • (23) 

Define the vector 

* :=r( e ''V* • 

Then 

1 < | x x | from (22) 

<0r o '*o) 1/2 (*'*) 1/2 

= (*o*o) 1/2 

< c from (23) . D 



For the third result consider, again with respect to Figure 1, the problem of 
attenuating the effect of w (no longer restricted to be bandlimited) on the control 
signal u ; that is, the problem is to achieve feedback stability by a controller 
which limits as much as possible the control effort. The transfer matrix from w 
to u equals KS , so the objective is to minimize H^SH^,. The case where P is 
stable is trivial: an optimal K is K =0. So we suppose K is not stable. For 
technical reasons it is assumed that P has no poles on the imaginary axis; thus 
P belongs to RL^ but not RH^. 

Bring in F P , the Hankel operator with symbol P . Let o- m ; n (r P ) denote its 
smallest singular value, i.e. the square root of the smallest nonzero eigenvalue of 
FpTp . This number can be easily computed via Theorem 5.1.3. 

Theorem 3. If P belongs to RL^ but not RH^, then the minimum value of 
\\ KS I loo over all stabilizing K 's equals the reciprocal of a mia (T P ). 

Proof. The proof is an interesting application of operator theory; some of the 
details are left as exercises. 

Again, we continue with the notation introduced in the proof of Theorem 1. 
From (1) and (2) we have 
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P = NM~ l 

XM -YN =1 , 

so that 

X - YP = M" 1 . 

Thus M" 1 £RL 00 . It follows from Theorem 7.4.1 that M has an inner-outer fac- 
torization: 

M = M { M , 

M t inner , 

M .J^-^RH^ . 

Thus P = (ArM _1 )M, _1 . Hence we may as well assume from the start that M is 
inner; similarly that M is co-inner. 

From (3) and (4) we have 
KS = {Y-MQ)M . 
Hence the minimum value of \\KS U^ over all stabilizing K 's equals 
mm{\\{Y-MQ)M\\ 0O :QeRBi O0 } 
= mm{\\Y-MQ\\ 00 :QeBM 00 } 
= dist(M- 1 r,RH 00 ) 
= dist(M _1 Y ,11^) by Lemma 8.1.3 
= ||r^ || by Nehari's theorem, 
where 

R := M~ l Y . (24) 

So proving the theorem is equivalent to showing that 

Wr\\ = {^JXp)Y 1 ■ (25) 

It would simplify matters if the two Hankel operators in (25) were surjective, 
but they're not: they're finite rank (Theorem 5.1.1). To make them surjective, 
we'll restrict their co-domains. 
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The image of T P is IIjP H 2 . This subspace of H/ is finite-dimensional, 
hence closed. Let's redefine T P to be 

T P :H 2 -> n x P H 2 

T P f ==11^/ . 
Now P =NM~ 1 and M~ l is the symbol of the Hankel operator 

r M -i:H 2 - n^^H, . 

Introduce a new operator, 

n N •ii 1 M- l s. 2 -+ njF h 2 
n N f -.= 11^/ . 

Exercise 1. Verify that 

r P =n ff r rl . (26) 

It follows from (2) that MY = YM . Thus R = YM~\ 

Exercise 2. Use the fact that M is inner to prove 

Tm-Jm-i = I ■ (27) 

Similarly 

T M lT k-i = I ■ ( 28 ) 

It follows from (2) that 
XM ~ NY = I 
XM -YN =1 . 

Exercise 3. Use these two equations to prove 

lijy \ly = — 1 , ily ilj\[ = —1 • 
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These two equations imply that 

n y rty = (n^Ojv) . (29) 

From (26) and (27) we have 

TpTp = n N Q^ . (30) 

Similarly 

TpT^nyQy. (31) 

Thus 

!|r«r«|| = ||n y ny|| from (31) 

= ||(n^n JV )- 1 ||from(29) 

= ||(n iV n;r 1 || 

= ||(r P r;r 1 ||from(30). 

This proves (25) since the largest eigenvalue of (T P T p) -1 equals the reciprocal of 
the smallest eigenvalue of T P Tp. U 

Example 1. 

The simplest possible example to illustrate Theorem 3 is 

P(0 = - J T - 

s -1 

Then 

* m m(rp) = ||r P || = i/2. 

Let's stabilize with a constant controller, K (s )=-fc . Then 

(KS)(s) = -k ~^~. 
s +k-l 

Clearly K stabilizes P iff k >1. We compute that 
\\KSWn = k if k>2 

= -*-if l<k<2 . 
k -1 
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Thus \\KS | log is minimized by the gain k =2. 

Notes and References 

The idea of Theorem 1 is due to Bensoussan and Zames: Bensoussan (1984) 
proved the theorem in the scalar-valued case with P stable, and Zames and Ben- 
soussan (1983) proved a result like Theorem 1 but where P was assumed to be 
diagonally dominant at high frequency and K was required to be diagonal. The 
proof given here is from Francis (1983). Theorem 2 is the multivariable generali- 
zation of a result of Francis and Zames (1984). Theorem 3 is due independently 
to Glover (1986), whose proof uses state-space methods, in contrast to the 
operator-theoretic one here, and to Verma (1985). Actually, their result pertains 
to the mathematically equivalent problem of robust stabilization: the largest 
radius of plant uncertainty for robust stabilizability equals the reciprocal of the 
smallest Hankel singular value of the nominal plant. 

For other results on achievable performance see, for example, Boyd and 
Desoer (1984), Freudenberg and Looze (1985), and O'Young and Francis (1985, 
1986). 
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